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MULTI-PAIR GIGABIT ETHERNET TRANSCEIVER 5 

CROSS REFERENCE TO RELATED APPUCAnONS | 

The present application claims priority on the basis of the following provisional 
^plications: Serial Number 60/130,616 entiUed "Multi-Pair Gigabit Eftemet # 
Transceiver" filed on April 22, 1999, Serial Number 60/1 16,946 entitled "Multiple 
Decision Feedback Equalizer" filed on January 20, 1999, Serial Number 60/108i319 
entitled "Gigabit Ethernet Transceiver" filed on November 13, 1998, Serial Nuin^ 
60/107,874 entitled "Apparatus for and Method of Distributii^ Clock Signals in ai 
Communication System" filed November 9, 1998, and Serial Number 60/107,880'entitled 
"Apparatus for and Method of Reducing Power Dissipation in a Communication |ystem" 
filed November 9, 1998. 

The present q)plication is related to the following co-pending application^? 
commonly owned by tiie assignee of flie; present application, tiie entire contents of each of 
which are expressly incorporated herem by reference: Serial Number 09/370,3 70^entitied 
"System and Mefliod for Trellis Decoding in a Multi-Pair Transceiver System", Serial 
Number 09/370,353 entitied "Multi-Pair Transceiver Decoder System with Lovs0 
Computation Sheer", Serial Number 09/370,354 entitied "System and Metiiod fbl High 
Speed Decoding and ISI Compensation in a Multi-Pah: Transceiver System" Seri^ 
Number 09/370,491 entitied "High-Speed Decoder for Multi-Pair Gigabit Transbiver". 
all filed October 10, 1999, and Serial Number 09/390,856 entitied Dynamic reguition of 
Power Consumption in a High-Speed Communication System" filed September 3| 1999. 

The present application is also related to flie following co-pending q)plications, 
filed oh mstant date hoBwith and commonly owned by the. assignee of the presei^f 
application, the entire contents of each of whidi are expressly mcorporated herein by 

reference: Serial Number entitied "Clock Generation and Disteibution in an 

Efliemet Transceiver" and Serial Number ■ entitied "Switching Noise R^iuction 

in a Multi-Clodc Domain Transceiver". -11. 
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FIELD OF THE INVENTION 

Hie present invention relates generally to high speed networking transceivers and, 
more particularly to gigabit Ethernet transceivers having reduced power consumption, 
eflBcicnt clock domain partitioning and able to decode input symbols within a symbol 
period with a minimum of computational intensity. 

DESCRIPHON OF THE RELATED ART 

In recent years, local area networic (LAN) applications have become more and more 
prevalent as a means for providing local interconnect between personal computer systems, 
work stations and servers. Because of the breadth of its installed base, the lOBASB-T 
inq)lemcntation of Ethernet remains flie most pervasive if not Ac dominant, network 
technology for LANs. However, as the need to exchange mformation becomes more and 
more imperative, and as the scope and size of the mformation being exchanged increases, 
higher and higher speeds (greater bandwidth) are required fiom network interconnect 
technologies. Among the highspeed LAN technologies currentiy avaUable, fest Ethernet, 
conunonly tenned 100BASE-T, has emeiged as the clear technological choice. Fast 
Ethernet technology provides a smooth, non-disruptive evolution from the 10 megabit per 
second (Mbps) performance of lOBASE-T applications to the 100 Mbps performance of 
100BASE-T. The growing use of 100BASE-T interconnections between servers and 
desktops is creating a definite need for an even higher speed network technology at the 
backbone and server level. 

One of die more suitable solutions to this need has been proposed in the IEEE 
802.3ab standard for gigabit Etiiemet, also termed lOOOBASE-T. Gigabit Ethernet is 
defined as able to provide 1 gigabit per second (Gbps) bandwidth in combination with the 
simplicity of an Ediemet architecture, at a lower cost than other technologies of 
comparable speed. Moreover, gigabit Ethernet offers a smooth, seamless upgrade path for 
present lOBASE-T or 100BASE-T Ethernet installations. 

In order to obtain the requisite gigabit perfonnance levels, gigabit Ethernet 
transceivers arc interconnected with a muiti^pair transmission channel architecture. In 
particular, transceivers are interconnected usmg four separate pairs of twisted Categoiy-5 
copper wires. Gigabit communication, in practice, involves the simultaneous, parallel 
transmission of information signals, widi each signal convoking information at a rate of 
250 megabits per second (Mh^s). Simultaneous, parallel transmission of four information 
signals ovct four twisted wire pairs poses substantial diallraiges to bidirectional 
communication transceivers, even though the data rate on any one wire pair is "only" 250 
Mbps. 

In particular, flie gigabit Ethernet standard requires that digital information being 
processed for transmission be symbolically represented in accorxlance wiA a five-level 
pulse amplitude modulation scheme (PAM-5) and encoded in accordance with an 8-state 
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Trellis coding methodology. Coded information is then communicated over a multi- 
dimensional parallel transmission channel to a designated receiver, vt^erc the original 
information must be extracted (demodulated) from a multi-level signal. In gigabit 

^. Ethersiet, it is important to note Aat it is the concat^iation of signal samples received 

simultaneously on all four twisted pair lines of the channel that defines a symbol. Thus, 
demodulator/decoder ardiitectures must be implemented with a degree of computational 
complexity that allows them to accommodate not only the "state width" of Trellis coded 
signals, but also the "dimensional depth" represented by the transmission charmel. 

IQ Computational complexity is not the only diaUenge presented to modem gigabit 

capable communication devices. A perhaps greater challenge is that the complex 
computations required to process "deep" and "wide" signal representations must be 
performed in an almost vanishingly small period of time. For example, in gigabit 
plications, each of the four-dimensional signal samples, formed by the four signals 

j5 received simultaneously over the four twisted wire pairs, must be eflBciently decoded 
within a particular allocated symbol time wiiidow of about 8 rianoseconds. 

Successfully accomplishing the multitude of sequential processing operations 
required to decode gigabit signal samples within an 8 nanosecond window requires that 
the switching cq)abilities of tiie integrated circuit technology from vMch the transceiver 

2Q is constructed be pushed to almost its fimdamental limits. If performed in conventional 
feshion, sequential signal processing op^ations necessary for signal decoding and 
demodulation would result in a propagation delay through the logic circuits that would 
exceed the clock period, rendering the transceiver circuit non-fimctional. Fundamentally, 
then, the challenge imposed by timing constraints must be addressed if gigabit Ethernet is 

25 to retain its viability and achieve the same reputation for accurate and robust operation 
enjoyed by its lOBASE-T and 100BASE-T siblings. 

In addition to the challenges imposed by decoding and demodulating multilevel 
signal samples, transceiver systems must also be able to deal with intersymbol interference 
(ISI) introduced by transmission channel artifacts as well as by modulation and pulse 

3Q shaping components in the transmission path of a remote transceiver system. During die 
demodidation and decoding process of Trellis coded information^ ISI components are 
introduced by either means must also be considered and compensated, further ejqianding 
the computational con:q}Iexity and flius, system latency of the transceiver system. 
Without a transceiver system cqjable of efBcient, high-speed signal decoding as well as 

35 simultaneous ISI compensation, gigabit Ethernet would likely not remain a viable concept 
In a Gigabit Ethernet commimication system that conforms to the lOOOBASE-T 
standard, gigabit transceivers are connected via Category 5 twisted pairs of copper cables. 
Cable r^ponses vary drastically among different cables. Thus, die computations, and 
h^ce power comsumption, required to compensate for noise (such as edio, near-oid 
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crosstalk, fer-end crosstalk) wUl vaiy widely depending on the particular cable that is 
used. 

In integrated circiiit technology, power consumption is generaUy recognized as 
being a function of the switching (clock) speed of transistor elements making up the 
circuitry, as weU as the number of component elements operating within a given time 
period. The more transistor elements operating at one time, and the higher the 
operational speed of tiie component circuitry, the higher the relative degree of power 
consumption for that circuit. This is particulariy relevant in the case of Gigabit Ethernet, 
since all computational circuits are clocked at 125 Mhz (corresponding to 250 Mbps per 
tvvisted pair of cable), and the processing requirements of such circuits require rather large 
blocks of computational circuitry, particularly in the filter elements. Power consumption 
figures in the range of fiom about 4 J Watts to about 6.0 Watts are not unreasonable when 
the speed and complexity of modem gigabit communication circuitry is considered. 

Pertinent to an analysis of power consumption is the realization tiiat power is 
dissipated, in integrated circuits, as heat As power consumption increases, not only must 
the system be provided with a more robust power supply, but also wifli enhanced heat 
dissipation schemes, such as heat sinks (dissipation fins coupled to die IC package), 
cooling fims, increased interior volume for enhanced air flow, and the like. AU of tiiese 
dissipation schemes involve considerable additional manufecturing costs and an extended 
design cycle due to tite need to plan for thermal considerations. 

Prior high speed commuriication circuits have not adequately addressed tiiese 
fliermal considerations, because of tiie primary necessity of accommodating high data 
rates witii a sufficient level of signal quality. Prior devices have, in effect, "hard wired" 
tiieir processing capabiUty, such tiiat processing circuihy is always operative to maximize 
signal quality, whetiier tiiat degree of processing is required or not Where channel quality 
is high, full-filter-tap signal processing more often obeys tiie law of diminishing returns, 
witii very smaU incremental noise margin gains recovered ftom tiie use of additional large 
Modes of active filter circuitry. 

This tiade-oflfbetweenpowa- consumption and signal quality has heretofore 
limited tiie options available to an integrated circuit communication system designer. If 
low power consumption is made a system requirement, tiie system typicaUy exhibits poor 
noise margin or bitrcrror-rate perfimnance. Conversely, if system performance is made 
tiie primary requirement, power consumption must faU where it may wifli tiie 
corresponding consequences to system cost and reliability. 

Accordingly, tiiere is a need for a high speed integrated circuit communication 
system design which is able to accomodate a wide variety of worst-case channel (cable) 
responses, wfaUe adaptively evaluating signal quality metiics in order tiiat processing 
circuihy might be disabled, and power consumption might tiiereby be reduced, at any such 
time tiiat flie circuitry is not necessary to assure a given minimum level of signal quality. 
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Such a system shoiild be able to adaptively determine and achieve the highest level 
of signal quality consistent witti a given maximum power consumption specification. In 
addition, such a system should be able to adaptively detemime and achieve the lowest 
level of power consumption consistent with a given minimum signal quality specificatiou. 

SUMMARY OF TEffi INVENTION 

The present invention is a method and a system for providing an input signal fix)m a 
multiple decision feedback equalizer to a decoder based on a tail value and a subset of 
coefficient values received fiom a decision-feedback equalizer. A set of pre-computed 
values based on the subset of coefficient values is generated. Each of the prc-computed 
values is combined with the tail value to generate a tentati ve sample. One of the tentative 
samples is selected as the input signal to the decoder. 

In one aspect of flic system, tentative samples are saturated and then stored in a set 
of registers before bemg ou^utted to a multiplexer vMch selects one of the tentative 
samples as the mput signal to the decoder. This operation of storing the tentative samples 
in the registers before providing the tentative samples to the multiplexer fecilitates high- 
speed operation by breaking up a critical path of computations into substantially balanced 
first and second portions, the first portion includmg computations in the decision-feedback 
equalizer and the multiple decision feedback equalizer, the second portion including 
computations in the decoder. 

The present invention can be directed to a system and method for decoding and ISI 
compensating received signal samples, modulated for transmission in accordance with a 
multi-level alphabet, and encoded in accordance with a multi-state encoding scheme. 
Modulated and encoded signal samples are received and decoded in an integrated circuit 
receiver vAnch includes a multi-state signal decoder. The multi-state signal decoder 
includes a symbol decoder adapted to receive a set of signal samples representing multi- 
state signals and evaluate the multi-state signals in accordance with the multi-level 
modulation alphabet and the multi-state encoding scheme. The symbol decoder outputs 
tentative decisions. 

An ISI compensation circuit is configured to provide ISI compensated signal 
samples to the symbol decoder. The ISI compensation circuit is constmcted of a smgle 
decision feedback equalizer, with the single decision feedback equalizer providing ISI 
compensated signal samples to the ^mbol decoder based on tmtative decisions outputted 
by the symbol decoder. 

In one aspect of the invention, a path memory module is coupled to the symbol 
decoder and receives decisions and error terms from the symbol decoder. The paA 
mcmoiy module includes a plurality of sequential regist^s, with each corresponding to a 
respective one of consecutive time intervals. The rcgista:s store decisions corresponding 
to the respective ones of the states of the multi-state encoded signals. Decision circuitry 
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selects a best decision from corresponding ones of the registers, with the best decision of a 
distal register defining a final decision. The best decision of an intermediate register 
defines a tentative decision which is pu^ut to the ISI compensation ciixniit 

5 The single decision feedback equalizer is configured as an FIR filter, and is • 

characterized by a multiplicity of coefficients, subdivided into a set of high-order \ 
coefficients and a set of low-order coefficients. Tentative decisions fiom the path memory 
module arc forced to the single decision feedback equalizer at various locations along the 
filter delay line and are combined wiA the high-order coefficients in order to define a 

10 partial ISI component The partial ISI component is arithmetically combined witfi an 
input signal sample in order to gttierate a partially ISI compensated intermediate signal 
called tail signal. 

Low-order coefficioits fiom the single decision feedback equalizer are directed to a 
convolution engine wherein they are combined with values representing the levels of a 

15 multi-level modulation alphabet The convolution engine outputs a multiplicity of signals, 
representing the convolution results, each of which are arithmetically combined with the 
tail signal to define a set of ISI compensated tentati ve signal samples. 

In a particular aspect of the invention, the ISI compensated tentative signal sinples 
are saturated and then stored m a set of registas before being ouQiutted to a multiplexer 

20 circuit which selects one of the tentative signal samples as the input signal to the symbol 
decoder. Storing tentative signal samples in the set of registers before providing the 
tentative signal samples to the multiplexer, fecUitates high-speed operation by breaking up 
a critical path of computations into substantially balanced first and second portions, flie 
first portion including computation in the ISI compensation circuitry, including the single 

25 decision feedback equalizer and the multiple decision feedback equalizer, the second . 
portion including computations in the symbol decoder. 

In a fiirther aspect of the present invraition, symbol decoder circuitry is 
in^lemented as a Viterbi decoder, the Viterbi decoder computing path metrics for each of 
the N states of a Trellis code, and outputting decisions based on the path metrics. A path 

30 memory module is coupled to the Viterbi decoder for receiving decisions. The path 
memory module is implemented with a number of depth levels corresponding to > 
consecutive time intervals. Each of the depth levels inchides N registers for storing; \ 
dedsions corresponding to die N states of the trellis code. Each of the depth levels further 
includes a muMplexer for selecting a best decision from the corresponding N registers, the 

35 best decision at the last depth level defining the final decision, the best decisions at other 
selected depth levels defining tentative decisions. 

In a particular aspect of the invention, tentative decisions are generated from die 
first tiiree dqjfli levels of the pafli memory module, Hiese tentative decisions are forced 
to a single decision feedback equalizer to generate a partial ISI component based oh tiie 
first tiuee tentative decisions and a set of high-order coefficients. The partial ISI • 
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component is arithmeticaUy combined with an input signal sample in order to define a 
partially ISI compensated tentative signal sample. 

The first two coefficients of fixe single decision feedback equalizer are linearly 
combined with values representing the five levels of a PAM-5 symbol alphabet, thereby 
generating a set of 25 pre-computed values, each of which are arithmeticaUy combined 
with the partial ISI compensated signal sample to develop a set of 25 samples, one of 
which is a fiiUy ISI compensated signal sample and is chosen as the input to the symbol 
decoder. 

The present invention is fijrther directed to a system and method for decoding 
information signals modulated in accordance wifli a multi-level modulation scheme and 

encoded in accordance with a multi-state encoding scheme by computing a distance 
between a received word from a codeword included in a plurality of code-subsets. 
Codewords are formed fiiom a concatenation of symbols from a multi-level alphabet, with 
the symbols selected from two disjoint symbol-subsets X and Y. A received word is 
represented by L inputs, with L representing the number of dimensions of a multi- 
dimensional communication channel. Each of the L inputs uniquely corresponds to one of 
flie L dimensions. A set of 1-dimensional (ID) errors is produced from the L inputs, 
with each of the ID errors representing a distance metoic between a respective one of the L 
inputs and a symbol in one of tiie two disjoint symbol-subsets. ID errors are combined in 
order to produce a set of L-dimensional errors such that each of tiie L-dimensional errors 
represents a distance between tiie received word and a nearest codeword in one of tiie 
code-subsets. 

In one embodimrait of tiie invention, each of tiie L inputs is sliced witii respect to 
each of tiie two disjoint symbol-subsets X and Y in order to produce a set of X-based 
errors, a set of Y-based errors and corresponding sets of X-based and Y-based decisions. 
The sets of X-based and Y-based errors form tiie set of ID errors, whUe tiie sets of X- 
based and Y-based decisions form a set of ID decisions. Each of flie X-based and Y- 
based decisions corresponds to a synibol, in a corresponding symbol subset, closest in 
distance (value) to one of tiie L inputs. Each of tiie ID errors represents a distance metric 
between a corresponding ID decision and tiie respective one of tiie L inputs. 

In anotiier embodiment of flie invention, each of tiie L inputs are sliced wifli respect 
to each of tiie two disjoint symbol subsets X and Y in order to produce a set of ID 
decisions. Each of tiie L inputs is fimher sUced witii respect to a symbol-set including all 
of flie symbols of flie two disjoint symbol-subsets in order to produce a set of hard 
decisions. The X-based and Y-based ID decisions are combined witii a set of hard 
decisions in order to produce a set of ID errors, witfi each of flie ID errors representing a 
distance metric between a corresponding ID decision and a respective one of flie L inputs. 
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In one embodiment of the present invention, I -dimensional errors are combined in 
a first set of adders in order to produce a set of 2-dimensional orors. A second set of 
adders combines the 2-dimensional errors in order to produce tntemiediate L-dimensional 
5 errors, with the intermediate L-dimensional errors being arranged into pairs of errors such 

that the pairs of errors correspond one-to-one to the code-subsets. A mmimum-select 
module determines a miinim i ifm for each of the pairs of errors. Once determined, flie 
minim a are defined as the L-dimensionai errors. 

The preset invention is flwlher directed to a method for dynamically regulating 
-10 *® power consumption of a high-speed integrated circuit which includes a multiplicity of 
processing blocks. A first metric and a second metric, widdi are respectively related to a 
first perfoniiance parmeter and a second performance parameter of the integrated circ^ 
aredefined. The first metric is set at a pre-defined value. Selected blocks of the 
multiplicity of processing blocks are disabled in accordance with a set of pi^-determined 
15 patterns. The second metric is evaluated, while the disabling operation is being performed, 
to generate a range of values of the second metric. Each of the values corresponds to the 
pre-defined value of the first metric. A most desirable value of the second metric is 
determined &om the range of values and is matched to a corresponding pre-determined 
pattern. The integrated circuit is subsequently operated with selected processing bloc^ 
20 disabled in accordance with the matching pre-determined pattern. 

In particular, the first and second performance parameters are distinct and are 
chosen fiiom the parametric group consisting of power consumption and a signal quality 
figure of merit. The signal quality figure of merit is evaluated while selected blocks of the 
multiplicity ofprocessing blocks are disabled Thesetof selected blocks which give the 
25 lowest power xxinsumption, when disabled, while at the same time maintaining an 

acceptable signal quality figure of merit at a pre-defined threshold level is maintained in a 
disabled condition while the integrated circuit is subsequently operated. 

In one aspect of the present invention, reduced power dissipation is chosen as the 
most desirable metric to evaluate, vMle a signal quality figure of merit is accorded 
30 secondary consideration. Alternatively, a signal quality figure of merit is chosen as the 
most desirable metric to evaluate, vAiHc power dissipation is accorded a secondary 
consideration. In a finHier aspect of tfie present invention, both signal quality and powea- 
dissipation are accorded equal consideration with selective blocks of the multiplicity of 
processing blocks being disabled and the resultant signal quality and power dissipation 

35 figures ofmerit being evduated so as to define a coexisting local inaxima of signd 
quality witii a local minima of power dissipation. 

In one particular embodiment, the present invention moy be characterized as a 
method for dynamically regulating the power consumption of a conmnmication system 
vAdch includes at least a first module. Tlie first module can be any circuit block, not 
necessarily a signal processing block. Power regulation proceeds by specifying a power 
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dissipation value and an enor value. An infonnation error metric and a power metric is 
computed. Activation and deactivation of at least a portion of the first module of the 
communication system is controlled according to a particular criterion. The criterion is 
based on at least one of the information aror metric, the power metric, the specified error 
and the specified power, to regulate at least one of the information metric and the power 
metric. 

In particular, at least a portion of the first module is activated if the information 
esTor metric is greater than the specified error and the first module portion is deactivated if 
the infonnation error metric is less than the specified error. In an additional aspect of the 
invention, flie first module portion is activated if the mformation error metric is greater 
than the specified error and the power metric is smaller than flie specified power. The first 
module portion is deactivated if the information mor metric is smaDer than the specified 
error or the power metric is greater than the specified power. In yet a fiirther aspect of the 
invention, the first module portion is activated if the information error metric is greater 
than the specified error and is deactivated if tiie infonnation error metric is smaller than a 
target value, the target value being smaller than the specified error. In yet another aspect 
of the invention, the first module portion is activated if the information error metric is 
gr^er than the specified error and the powa- metric is smaller than the specified power. 
The first module portion is deactivated if the information error metric is smaller than a 
target value, the target value being smaller than the specified error, or the power metric is 
greater than the specified power. 

Advantageously, tiie infonnation error metric is related to a bit error rate of the 
conununication system and the information error metric is a measure of perfonnance 
degradation in the commimication system caxised by deactivation of the portion of the first 
module. Where the module is a filter which includes a set of taps, with each of the taps 
including a filter coefficient, the information error metric is a measure of performance 
degradation of a transceiv^ caused by operation of the filter. 

Power dissipation reduction is implemented by deactivating subsets of taps which 
make vsp the filter, until such time as perfonnance degradation caused by the truncated 
filter readies a pre-detennined threshold level. 

The preset invention fiirther provides a method for reducirig system performance 
degradation caused by switdiing noise in a system vAuch includes a set of subsystems. 
Each of the subsystems includes an analog section and a digital sectioiL Each of the 
analog sections operates in accordance with a corresponding one of a set of sampling 
clock signals which are synchronous in frequency. The digital sections operate in 
accordance with a receive clock signal. Hie receive clock signal is generated such that it 
is synchronous in fi:equency with the sampling clock signals and has a phase of&et with 
respect to one of the sarr^Iing clock signals. This phase offset is adjusted such that 
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system pafonnance degradation due to coupling of switching noise fixim the digital 
sections to the analog sections is substantially minimized. 

BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features, aspects and advantages of the present inveation will be 
more fiiUy understood when considered with respect to the following detailed description, 
appended claims and iaccompan)ring drawings, v\dia:ein: 

FIG. 1 is a simplified, semi-schematic block diagram of a high-speed bidirectional 
communication system exemplified by two transceivers configured to commxmicate over 
multiple twisted-pair wiring channels* 

na2 is a simplified, semi*schematic block diagram of a bidirectioiial 
communication transceiver system, constructed in accordance with the present invention. 

FIG. 3 is a simplified, semi-schanatic block diagram of an exemplary trellis 
decoder, including a Viterbi decoder, in accordance with the invention, suitable for 
decoding signals coded by the exemplary tiellis encoder of FIG. 6. 

FIG. 4A illustrates an exemplary PAM-5 constellation and the one-dimensional 
symbol-subset partitioning. 

FIG. 4B illustrates the eight 4D code-subsets constructed from the one-dimensional 
symbol-subset partitioning of the constellation of FIG. 4A. 

FIG. 5 illustrates the trellis diagram for the code. 

FIG, 6 is a simplified, semi-schematic block diagram of an exemplary trellis 
encoder. 

FIG. 7 is a simplified block diagram of a first exemplary embodiment of a 
structural analog of a ID slicing fimction as might be implemented in the Viterbi decoder 
ofnG.3. 

FIG. 8 is a simp l if ied block diagram of a second excmplaiy embodinieiit of a 
structural analog of a ID slicing function as might be implemented in the Viterbi decoder 

of no. 3. 

FIG. 9 is a simplified blodc diagram of a 2D error term generation machine, 
illustrating the genemtion of 2D square OTor terms fix)m the ID square error terms 
developed by the exemplary slicers of FIGs, 7 or 8. 

no. 10 is a simplified block diagram of a 4D error term generation machine, 
illustrating the gen^tion of 4D square error terms and the generation of extended path 
metrics for the 4 extended paths outgoing from state 0. 

nan is a simplified block diagram of a 4D symbol generation machine. 

FIG. 12 illustrates the selection of Ae best path incoming to state 0. 

FIG. 13 is a semi-schematic block diagram illustrating the internal arrangement of a 
portion of the path manory module of FIG. 3. 
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FTG. 14 is a block diagram illustrating the computation of the final decision and the 
tentative decisions in the path memory module based on the 4D symbols stored in the path 
memory for each state. 

FIG. IS is a detailed diagram illustrating the processing of the outputs ^ , 
^«^'^,with i=0,...,7, and Vof. V,f, of the path memory module of FIG. 3. 

FIG. 16 shows the word lengths used in one embodiment of this invention. 

FIG. 17 shows an exemplary lookup table suitable for use in computing squared 
one-dimensional error terms. 

FIGs, 18A and 18B are an exemplary look-iq) table v^ch describes the 
computation of the decisions and squared errors for both the X and Y subsets directly 
from one component of flie 4D Viterbi input of the ID slicers of FIG. 7. 

FIG. 19 illustrates the general clocking relationship between the transmitter and the 
receiver inside each of the four constituent transceivers 108 of the gigabit Ethernet 
transceiver (101 or 102) of FIG. 1; 

FIG. 20 is a simplified block diagram of an embodiment of the timing recovery 
system constructed according to the present invention; 

FIG. 21 is a block diagram of an exemplary implementation of the system of FIG. 

20; 

FIG. 22 is a block diagram of an exemplary embodiment of the phase reset logic 
block used for resetting the register of the NCO of FIG. 21 to a specified value; 

FIG. 23 is a block diagram of an exemplary phase shifter logic block used for the 
phase control of the receive clock signal R(XK; 

FIG. 24 is a flowchart of an embodiment of the process for adjusting the phase of 
the receive clock signal RCLK4 

FIG. 25A is a first example of clock distribution where the transitions of the four 
sampling clock signals ACLKO - 3 are evenly distributed within the symbol period. 

FIG. 25B is a second example of clock distribution v/hcrc the transitions of the four 
sampling clock signals ACLKO - 3 are distributed within the symbol period of 8 
nanoseconds (ns) such that each ACLK clock transition is 1 ns apart &om an adjacent 
ACLK clock transition. 

FIG. 2SC is a third example of clock distribution where the transitions of the four 
sampling clock signals ACLKO - 3 occur at the same instant widiin the symbol period. 

FIG. 26 is a flowdiart of an onbodiment of tiie process for adjusting the phase of a 
sampling clodc signal ACLKx associated with one of the constituent transceivers; 

FIG. 27 is a block diagram of an embodiment of the MSE computation blodc used 
for computing the mean squared error of a constituent transceiver. 

FIG. 28 is a simplified matrix diagram illustrating the relationship between power 
consumption and a performance metric; 
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FIG, 29A is a simplified structure diagram of an adaptive FIR filter as might be 
implemented as an echo/NEXT canceller circuit in one embodiment of a transcei ver in 
accordance with the present invention; 

FIG, 29B is an equivalent structure of the adaptive FIR filter shown in FIG. 29 A; 

FIG. 29C is a simplified structure diagram of an alternative adaptive FIR filter 
including a modification to the structure of FIG. 29B to bypass a deactivated tap; 

FIG. 29D is a simplified block diagram of a deactivate-able coeflBcieait multiplier 
circuit such as might be implemented in the filters of FIGs. 29A, 29B and 29C; 

FIG. 30 is a flowchart depicting a first exemplary embodiment of an adaptive 
power reduction method according to the present invention; 

FIG. 3 1 is a flowchart depicting one exemplaiy embodiment of an activation block 
according to the method of FIG. 30; 

FIG. 32 is a flowchart depicting one exemplary embodiment of a deactivation block 
according to the method of FIG. 30; 

FIG. 33 is a flowrchart of one embodiment of the computing block 514 of FIG. 30; 

FIG, 34 is a flowchart depicting one exemplary embodiment of a power-down 
block according to the method of FIG. 30; 

FIG. 35 is a graph of an exemplary impulse response of the echo characteristics of a 
typical channel; 

FIG. 36 is a gr^h of an exemplary impulse response of the near-end crosstalk 
(NEXT) characteristics of a typical channel; 

FIGs. 37A and 37B are graphs of the mean squared error to signal ratio 
(MSE/signal) expressed in dB as a fonction of time, with time expressed in bauds, of 
exemplary Master and Slave transceivers, respectively; 

FIGs. 38A and 38B are graphs of the values of the tap coefficients of an exemplary 
echo canceller as a fimction of the tap number, after application of the tap power 
regulating process with the specified error set at -24 dB and -26 dB, respectively; 

FIG. 39 is a block diagram of an ^emplary trellis decoder as applied to a case in 
vMch there is substantially no intersymbol interference; 

FIG. 40 is a simplified block diagram of an alternative embodiment of &e 
invention in v/bich power consumption is reduced by substitution of a symbol-by-symbol 
decoder in place of a Viterbi decoder; 

DETAILED DESCRIPTION OF THE INVENTION 

In the context of an exemplary integrated circuit-type bidirectional communication 
system, the present invention might be characterized as a system and mediod fi)r 
accommodating efficient, high speed decoding of signal samples encoded according to the 
trellis code specified in the IEEE 8023ab standard (also termed lOOOBASE-T standard). 
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As will be understood by one having skill in the art, high speed data transmission 
is often limited by the ability of decoder systems to quickly, accurately and effectively 
process a transmitted symbol wifliin a giyen tune period. In a lOOOBASE-T application 
(aptly termed gigabit) for example, the symbol decode period is typically takoi to be 
^proximately 8 nanoseconds. Pertinent to any discussion of symbol decoding is the 
realization that lOOOBASE-T systems are layered to receive 4-dmiensionaI (4D) signals 
(each signal corresponding to a respective one of four twisted pair cables) with each of the 
4-dunensional signals represented by five analog levels. Accordmgly, the decoder 
circuitry portions of transceiver demodulation blocks require a multiplicity of operational 
steps to be taken in order to effectively decode each symbol. Such a multiplicity of 
operations is computationally complex and often pushes the switching speeds of 
integrated circuit transistors which make up the computational blocks to their fimdamental 
limits. 

In accordance with the present invention, a transceiver decoder is able to 
substantially reduce the computational complexity of symbol decoding, and thus avoid 
substantial amounts of propagation delay (i,e., increase operational speed), by making use 
of truncated (or partial) representations of various quantities that make up the 
decoding/ISI compensation process. 

Sample slicing is performed in a manna: such that one-dimensional (ID) square 
error terms are developed in a representation having, at most, three bits if the terms signify 
a Euclidian distance, and one bit if the terms signify a Hamming distance. Truncated ID 
error term representation significantly reduces subsequent error processing complexify 
because of the fewer number of bits. 

Likewise, ISI compensation of s^ple signals, prior to Viterbi decodmg, is 
performed in a DFE, opcralively responsive to tentative decisions made by the Viterbi, 
Use of tentative decisions, instead of a Viterbi's final decision, reduces system latency by 
a factor directly related to the path memory sequence distance between the tentative 
decision used, and the fimal decision, i.e., if there are N steps in the path memory ftom 
input to final decision ou^ut, and latency is a fimction of N, forcing the DFE with a 
tentative decision at step N-6 causes latracy to become a fimction of N-6. A trade-oflf 
between latency reduction and accuracy may be made by choosing a tentative decision 
step either closer to the final decision point or closer to the initial pomt 

Computations associated with removmg unpauments due to intersymbol 
interference (ISI) are substantially simplified, in accordance with the present invention, by 
a combination of techniques that involves the recognition that intersymbol interference 
results from two prirnary causes, a partial response pulse shapmg filter in a transmitter and 
fiom the characteristics ofaurishielded twisted pair traiismissiondianneL During the 
initial start-up, ISI impairments are processed in indq)endent portions of electronic 
circmtry, with ISI caused by a partial response pulse shaping filter being compensated in 
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an inverse partial response filter in a feedforward equalizer (FFE) at system startup, and 
ISI caused by transmission channel characteristics compensated by a decision feedback 
equalizer (DFE) operating in conjunction with a multiple decision feedback equalizer 
(MDFE) stage to provide ISI pre-compensated signals (representing a symbol) to a 
decoder stage for symbolic decode. Performing the computations necessary for ISI 
cancellation an a bifiircated manner allows for fiast DFE convergence as well as assists a 
transceiver in achieving fest acquisition in a robust and reliable manner. After the start- 
vspj aD ISI is compensated by the combination of the DFE and MDFE. 

In order to appreciate the advantages of the present invention, it will be beneficial 
to describe the invention in the context of an exemplary bidirectional communication 
device, sudi as a gigabit Efliemet transceiver. The particular exemplary implementation 
diosen is depicted in FIG. 1, which is a sunplified block diagram of a multi-pair 
communication system operating in conformance widi the IEEE 802.3ab standard for one 
gigabit (Gb/s) Ettiemet fiill-di^lex communication over four twisted pairs of Category-5 
copper wires. 

The conununication system illustrated m FIG. 1 is represented as a point-to-point 
system, in order to sunplify the explanation, and includes two main transceiver blocks 102 
and 104, coupled together with four twisted-pair cables. Each of the wire pairs 1 12a, b, c, 
d is coupled between the transceiver blocks through a respective one of four line interfece 
dxcuits 106 and conmiunicate information developed by respective ones of four 
transmitter/receiver circuits (constituent transceivers) 108 coupled between respective 
mt^fece circuits and a physical coding sublayer (PCS) block 1 10, Four constituent 
transceivers 108 arc capable of operating simultaneously at 250 megabits per second 
(Mb/s), and are coupled through respective interface circuits to fecilitate fiiU-duplex 
bidirectional operation. Thus, one Gh/s corrmiunication throughput of each of the 
transceiver blocks 102 and 104 is achieved by using four 250 Mb/s (125 Megabaud at 2 
bits per symbol) constituent transceivers 108 for each of the transceiver blocks and four 
twisted pairs of copper cables to connect the two transceivere together. 

FIG. 2 is a simplified block diagram of the functional architecture and internal 
construction of an exemplary transceiver block, indicated generally at 200, such as 
transceiver 102 of FIG. L Since the illustrated transceiver application relates to gigabit 
Ethernet transmission, the transceiver will be referred to as the "gigabit transceiver". For 
ease of illustration and description, FIG. 2 shows only one of tiie four 250 Mh/s 
constituent transcdvers vMch are operating simultaneously (termed herein 4-D 
operation). However, since the operation of the four constituent transceivers are 
necessarily interrelated, certam blocks in the signal lines m the exemplary embodiment of 
FIG. 2 perform and cany 4-dimensional (4-D) fimctions and 4-D signals, respectively. By 
4-p, it is meant that the data fit)m the four constitu^t transceivers are used 
simultaneously. In order to clarify signal relationships in FIG. 2, thin lines conespond to 



-14- 



1-dimensional fimcticms or signals (ue., relating to only a single transceiver), and thick 
lines correspond to 4-D functions or signals (relating to all four transceivers). 

With reference to FIG. 2, Ae gigabit transceiver 200 includes a-Gigabit Medium 
Indepraident Interfiace (GMII) block 202, a Physical Coding Sublayer (PCS) block 204, a 
pulse shaping fUter 206, a digital-to-analog (D/A) converter 208, a line interfece block 
210, a highpass filter 212, a programmable gain amplifier (PGA) 214, an analog-to-digital 
(A/D) converter 216, an automatic gam control block 220, a timing recovery block 222, a 
pair-swap multiplexer block 224, a demodulator 226, an offect canceler 228, a near-end 
crosstalk (NEXT) canceler block 230 having three NEXT cancelers, and an echo canceler 
232. The gigabit transceivo- 200 also includes an A/D first-in-fiist-out buffer (FIFO) 2 1 8 
to facilitate proper transfer of data fiom the analog clock region to the receive clock 
megion, and a FIFO block 234 to feciUtate proper transfer of data fiom flie transmit clock 
region to the receive clock region. The gigabit transceiver 200 can optionally include a 
filter to cancel fiir-end crosstalk noise (FEXT canceler). 

On the transmit path, the transmit section of the GMII block 202 receives data 
fix)m a Media Access Control (MAC) module (not shown in FIG. 2) and passes the digital 
data to the transmit section 204T of die PCS block 204 via a FIFO 201 in byte-wide 
format at the rate of 125 MHz . The FIFO 201 is essentially a synchronization buffer 
device and is provided to ensure proper data transfer fixjm the MAC layer to the Physical 
Coding (PHY) layer, since the transmit clock of the PHY layer is not necessarily 
synchronized with Ae clock of the MAC layer. Hiis small FIFO 20 1 can be constructed 
widi fiom three to five memory cells to accommodate the elasticity requirement which is a 
function of fi:ame size and firequency ofi&et 

The transmit section 204T of the PCS block 204 performs scrambling and coding 
of the data and other control fimctions. Transmit section 204T of the PCS block 204 
generates four ID symbols, one for each of the four constituent transceivers. The ID 
symbol generated for the constituent transceiver depicted in FIG. 2 is filtered by a partial 
response pulse shaping filter 206 so that the radiated emission of the ou^ut of the 
transceiver m^ Ml within the EMI requirements of the Federal Communications 
Commissioa The pulse shaping filter 206 is constructed with a transfer function 0.75 
■H).252r', such tiiat thepowa- spectrum of the ou^ut of the transceiver fells below tiie 
power spectrum of a lOOBase-Tx signal. Tlie 100Base-TX is a widely used and accepted 
Fast Ethernet standard for 100 Mb/s operation on two pairs of category-5 twisted pair 
cables. Hie output of the pulse shaping filter 206 is converted to an analog signal by the 
D/A converter 208 operating at 125 MHz. The analog signal passes through the line 
interfece block 210, and is placed on the corresponding twisted pair cable for 
communication to a remote receiver. 

On the receive path, the line interfece block 210 receives an analog signal from the 
twisted pair cable. The received analog signal is preconditioned by a higlqiass filter 212 
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and a programmable gain amplifier (PGA) 214 before being converted to a digital signal 
by the A/D converter 216 operating at a sampling rate of 125 MHz. Sample timing of the 
A/D converter 216 is controlled by the ou^ut of a timing recovery block 222 controlled, 
in turn, by decision and error signals fiom a demodulator 226. The resulting digital signal 
is properly transferred fiom the analog clock region to the receive clock region by an A/D 
FIFO 2 18, an output of vAdch is also used by an automatic gain control circuit 220 to 
control the operation of the PGA 214. 

The output of the A/D FIFO 2 1 8, along with the outputs from the A/D FIFOs of 
the othor three constituent transceivers are inputted to a pair-swap multiplexer block 224. 
The pair-swap multiplexer block 224 is operatively responsive to a 4D pair-swap control 
signal, asserted by the receive section 204R of PCS block 204, to sort out the 4 ii^ut 
signals and send the correct signals to the respective demodulators of the 4 constituent 
transceivers. Since the coding scheme used for the gigabit transceivers 102, 104 (referring 
to FIG. 1) is based on the fiict that each twisted pair of wire corresponds to a ID 
constellation, and that the four twisted pairs, collectively, form a 4D constellation, for 
siymbol decoding to fimction properly, each of the four twisted pairs must be tmiquely 
identified with one of the four dimensions. Any undetected swapping of the four pairs 
would necessarily result in erroneous decoding. Although described as performed by the 
receive section 204R of PCS block 204 and the pah-swap multiplexer block 224, in the 
exemplary embodiment of FIG. 2, the pair-swapping control might alternatively be 
performed by the demodulator 226. 

Demodulator 226 receives the particular received signal 2 intended for it fiom the 
pair-swap multiplexer block 224, and fimctions to demodulate and decode the signal prior 
to directing the decoded symbols to the PCS layer 204 for transfer to the MAC. The 
dmiodulator 226 includes a multi-component feedforward equalizer (FFE) 26, having its 
ou^ut coupled to a de-skew memory circuit 36 and a trelUs decoder 38. The FFE 26 is 
multi-component m the sense that it includes a pulse shaping filter 28, a programmable 
inverse partial response (EPR) filter 30, a summing device 32, and an adaptive gain stage 
34. Functionally, the FFE 26 might be characterized as a least-mean-squares (LMS) type 
adaptive filter which performs charmel equalization as described in the following. 

Pulse shaping filter 28 is coupled to receive an input signal 2 from the pair swap 
MUX 224 and fimctions to gaierate a precursor to the input signal 2. Used for timing 
recovery, the precursor mi^t be apfly described as a zero-crossing inserted at a precursor 
position of the signal. Sudi a zero-crossing assists a timing recovery circuit in 
determining phase rclationships between signals, by giving the timing recovery circuit an 
accurately delermmable signal transition point for use as a reference. The pulse shapmg 
filter 28 can be placed any\^ere before the decoder block 38. In the exemplary 
CTobodimait of FIG. 2, tiie pulse shapmg filter 28 is positioned at the input of the FFE 26. 



-16- 



wo 00/28691 

PCT/US99/26493 

The pulse shaping filter 28 transfer function may be represented by a function of 
the form -y+z ', with y equal to 1/16 for short cables Oess than 80 meters) and 1/8 for long 
cables (more than 80 m). The determination of the length of a cable is based on the gain 
of the coarse PGA section 14 of the PGA 214. 

A progranmiable inverse partial response (IPR) filter 30 is coupled to receive the 
output of the pulse shaping filter 28, and fiinctions to compensate the ISI mtroduced by 
the partial response pulse shying in the transmitter section of the remote transceiver 
which transmitted the analog equivalent of the digital signal 2. The IPR filter 30 transfer 
fimction may be represented by a function of the form 1/(1+Kz-') and may also be 
described as dynamic. In particular, the filter's K value is dynamically varied fiom an 
initial non-zero setting, vaHd at system start-up, to a final setting. K may take any positive 
value strictly less than 1. In the illustrated embodunent, K might take on a value of about 
0.484375 during startup, and be dynamicaUy ramped down to zero after convergence of 
the decision feedback equalizer included mside the trellis decoder 38. 

The foregoing is particularly advantageous in high-speed data rccoveiy systems, 
since by compensating the transmitter induced ISI at start-iqj, prior to decoding, it reduces 
the amount of processing required by the decoder to that requned only for compensating 
transmission channel induced ISI. Hiis "bifurcated" or divided ISI compensation process 
aUows for fast acquisition in a robust and reliable manner. After DFE conveigence, noise 
enhancement in the feedforward equalizer 26 is avoided by dynamicaUy rampmg the 
feedback gam fector K of the IPR filter 30 to zero, eflfectively removing the filter fiom the 
active computational path. 

A summing device 32 subtracts fixim the ou^ut of die IPR filter 30 the signals 
received from the ofl&et canceler 228, the NEXT canceleis 230, and the echo canceler 
232. The offset canceler 228 is an adaptive filter which generates an estimate of the ofi&et 
introduced at the analog fiont end which includes the PGA 2 14 and the A/D converter 
216. Likewise, the three NEXT cancelers 230 are adaptive filters used for modeling the 
NEXT impairments in the received signal caused 1^ flie symbols sent hy the three local 
Iransmittars of flie other tiiree constituent transceivers. The unpairmraits arc due to a near- 
eaid oosstalk mechanism between the pairs of cables. Since each receiver has access to 
the data transmitted by the other three local transmitters, it is possible to neariy replicate 
the NEXT impairments through filtering. Referring to FIG. 2, the three NEXT cancelers 
230 filter the signals sent by the PCS block 204 to the other three local transmitters and 
produce three signals rq)licating the respective NEST unpaitmeats. By subtracting these 
three signals fiom the ou^ut of the IPR filter 30, the NEXT impainhents are 
{Qjproximately canceled. 

Due to flie bi-directional nature of the chaimel, each local transmitter causes an 
echo impairment on the received signal of the local receiver with whidi it is paii«d to 
form a constituent transceivo-. The echo canceler 232 is an adaptive filter used for 
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modeling the echo impairment The echo canceler 232 filters the signal sent by the PCS 
block 204 to the local transmitter associated with the receiver, and produces a teplica of 
the echo impairment By subtracting this replica signal from the output of the IPR filter 
30, Ae edio impairment is ^proximately canceled 

Following NEXT, echo and ofiEset cancellation, the signal is coupled to an adaptive 
gain stage 34 which fimctions to fine tune the gain of the signal path using a zero-forcing 
LMS algorithm. Since this adaptive gain stage 34 trains on the basis of errors of the 
adaptive offset, NEXT and echo cancellation filters 228, 230 and 232 respectively, it 
provides a more accurate signal gain than the PGA 214. 

The output of the adaptive gain stage 34, which is also the output of the FFE 26, is 
inputted to a de-skew memory 36. The de-skew memory 36 is a four-dimensional fimction 
block, i.e., it also receives tfie outputs of the three FFEs of the other three constituent 
transceivers as well as the output of FFE 26 illustrated m FIG. 2, There may be a relative 
skew in the ou^uts of the 4 FFEs, which are the 4 signal samples representing the 4 
symbols to be decoded This relative skew can be up to 50 nanoseconds, and is due to the 
variations in the way the copper wire pairs are twisted In order to correctly decode the 
four symbols, flie four signal samples must be properly aligned. The de-skew memory is 
responsive to a 4D de-skew control signal asserted by the PCS block 204 to de-skew and 
align the four signal samples received from the four FFEs. The four de-skewed signal 
samples are then directed to the trellis decoder 38 for decoding. 

Data received at the local transceiver was encoded, prior to transmission by a 
remote transceiver, using an 8-state four-dimensional trellis code. In the absence of inter- 
symbol interference (ISI), a propo* 8-state Viterbi decoder would provide optimal 
decoding of this code. However, in the case of Gigabit Ethernet, the Category-5 twisted 
pair cable introduces a significant amount of ISI. In addition, as was described above in 
connection wifli the FFE stage 26, the partial response filter of the remote transmitter on 
die other end of the communication channel also contributes a certain component of ISI. 
Hierefore, during nominal operation, flie trellis decoder 38 must decode both the trellis 
code and compensate for at least transmission charmel induced ISI, at a substantially high 
con^utational rate, corresponding to a symbol rate of about 125 Mhz. 

In the illustrated embodiment of the gigabit transceiver of FIG. 2, the trellis 
decoder 38 suitably includes an 8-state Viterbi decoder for symbol decoding, and 
incorporates circuitry vMch implements a decision-feedback sequence estimation 
^^ach in order to compensate the ISI components perturbing the signal vAdcb 
represents transmitted symbols. The 4D output 40 of the trellis decoder 38 is provided to 
tile receive section 204R of the PCS block. The receive section 204R of PCS block de- 
scrambles and fiirther decodes the symbol stream and then passes the decoded packets and 
idle stream to the receive section of the GMII block 202 for transfer to the MAC module. 
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The 4D ou^uts 42 and 44, which represent the error and tentative decision signals 
defined by the decoder, respectively, are provided to the timing recovery block 222, uiiose 
output controls the sampling time of the A/D converter 216. One of the four components 
of the OTor 42 and one of the four components of the tentative decision 44 correspond to 
the signal stream pertinent to tfie particular receiver section, illustrated in FIG. 2, and are 
provided to the adaptive gam stage 34 to adjust the gam of the signal path. 

The component 42A of the 4D error 42, vAnch corresponds to the receiver shown 
in FIG. 2, is further provided to the adaptation circuitry of each of the adaptive ofl&et, 
NEXT and echo cancellation filters 228, 230, 232. Ad^tation circuitry evaluates the 
content of the error component and, initially, adapts the filter's training process to develop 
suitable filter coefficient values. During nominal operation, adaptation circuitry monitors 
the error component and provides periodic updates to Ae filter coefficients in response 
thereto. 

As implemented in the exemplary Ethernet gigabit transceiver, the trellis decoder 
38 fimctions to decode symbols that have been encoded in accordance with the trellis code 
specified in the IEEE 802.3ab standard (lOOOBASE-T, or gigabit). As mentioned above, 
information signals are communicated between transceivers at a symbol rate of about 125 
Mhz, on each of the pairs of twisted copper cables that make up the transmission channel 
In accordance with established Ethernet communication protocols, information signals are 
modulated for transmission in accordance with a 5-level Pulse Amplitude Modulation 
(PAM-5) modulation scheme. Thus, since information signals arc represented by five 
amplitude levels, it will be understood that symbols can be expressed in a three bit 
representation on each twisted wire pair. 

Tummg now to FIGs. 4A and 4B, an exemplary PAM-5 constellation is depicted in 
FIG. 4A vAiich also dq)icts the one-dimensional symbol subset partitiorung within tiie 
constellation. As illustrated inFIG. 4A, the constellation is a representation of five 
amplitude levels, +2, +1, 0, -1, -2, in decreasing order. Symbol subset partitioning occurs 
by dividing the five levels into two ID subsets, X and Y, and assigning X and Y subset 
designations to the five levels on an alternating basis. Thus +2, 0 and -2 are assigned to 
the Y subset; +1 and -1 are assigned to the X subset The partitioning could, of course, be 
reversed, with +1 and -1 being assigned a Y designation. 

It should be recognized tiiat altiiough the X and Y subsets represent diflFaent 
absolute amplitude levels, the vector distance between neighboring amplitudes within flie 
subsets are die same, i.e., two (2). The X subset tfierefore includes amplitude level 
designations vAnch diflfer by a value of two, (- 1 , +1), as does the Y subset (-2, 0, +2). Hiis 
partitioning offers certaux advantages to slicer cucuitry m a decoder, as will be developed 
fiirther below. 

In FIG. 4B, the ID subsets have been combined into 4D subsets representing the 
four twisted pairs of the transmission charmel. Since ID subset definition is bmary (X:Y) 
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and there are four wire pairs, there are sixteen possible combmations of 4D subsets. These 
sixteen possible combinations are assigned into eight 4D subsets, sO to s7 inclusive, in 
accordance with a trellis coding scheme. Each of the 4D subsets (also termed code 
subsets) are constructed of a union of two complementary 4D sub-subsets. e.g.. code- 
subset three (identified as s3) is the union of sub-subset X:X:Y:X and its complementary 
image Y:Y:X:Y. 

Data being processed for transmission is encoded using the above described 4- 
dimensional (4D) 8-state trellis code, in an encoder circuit, such as iUustrated in the 
exemplary block diagram of FIG. 6, according to an encoding algorithm specified in the 
lOOOBASE-T standard. Referring to FIG. 6, an exemplary encoder 300. which is 
commonly provided in the transmit PCS portion of a gigabit transceiver, might be 
represented in simplified fi>nn as a convolutional encoder 302 in combination with a 
signal mapper 304. Data received by the transmit PCS fiom the MAC module via the 
transmit gigabit medium independent interfece are encoded with control data and 
scrambled, resulting in an eight bit data word represented by input bits Do through D, 
which are introduced to the signal mqiper 304 of the encoder 300 at a data rate of about 
125 MHz. The two least significant bits. Do and D,. are also inputted, in paraUel fashion, 
into a convolutional encoder 302, implemented as a linear feedback shift register, in order 
to generate a redundancy bit C which is a necessary condition for the provision of the 
coding gain of the code. ^ -- ■ 

As described above, the convolutional encoder 302 is a linear feedback shift 
register, constructed of three delay elements 303, 304 and 305 (conventionally denoted by 
z ') inten^)ersed with and separated by two summing circuits 307 and 308 which fimction 
to combine the two least significant bits (LSBs), Do and D„ of the input word with the 
output of the first and second delay elements. 303 and 304 respectively. The two time 
sequences formed by the streams of the two LSBs are convolved with the coefficients of 
the linear feedback shift register to produce the time sequence of the redundancy bit C. 
Thus, the convolutional encoder might be viewed as a state machine. 

The signal mapper 304 maps the 9 bits (Dj-D, and C) mto a particular 4- 
dimensional constellation point Each of the four dimensions uniquely corresponds to one 
of the four twisted wire pairs. In each dimension, the possible symbols are fiom the 
symbol set {-2, -1, 0, +1. +2}. The symbol set is partitioned into two disjoint symbol 
subsets X and Y. with X={-1, +1 ) and Y={-2. 0, +2}, as described above and shown in 
no. 4A. 

Referring to HG. 4B. the eight code subsets sO through s7 define the constellation 
of the code in the signal space. Each of the code subsets is formed by the union of two 
code sub-subsets, each of the code sub-subsets being formed by 4D patterns obtained fiom 
concattaiation of symbols taken fiom the symbol subsets X and Y. For example, the code 
subset sO is formed by the union of the 4D patterns fiom the 4D code sub-subsets XXXX 
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and YYYY. It should be noted that the distance between any two aibitrary even 
(respectively, odd) code-subsets . It should be further noted that each of the code 
subsets is able to define at least 72 constellation points. However, only 64 constellation 
pomts in each code subset are recognized as codewords of the trellis code specified in the 
lOOOBASE-T standard. 

This reduced constellation is termed the immed constellation. Hereinafter, tbe 
term **codewonI" is used to indicate a 4D symbol that belongs to the pruned constellation. 
A valid codeword is part of a valid path in the trellis diagram. 

Referring now to FIG. 6 and with reference to FIGs, 4A and 4B, in operation, the 
signal mapper 304 uses the 3 bits D„ Do and C to select one of the code subsets sO ^ s7, 
and uses the 6 MSB bits of Ae input signal, Dj-D^ to select one of 64 particular pomts in 
the selected code subset These 64 particular points of the selected coded subset 
correspond to codewords of the trellis code. The signal mapper 304 outputs the selected 
4D consteDation point 306 vMch will be placed on the four twisted wire pairs after pulse 
shape filtering and digital-to-analog conversion. 

no. 5 shows the trellis diagram for the trellis code specified in the lOOOBASE-T 
standard. In the trellis diagram, each vertical colunm of nodes represents tiie possible 
states that the encoder 300 (FIG. 6) can assume at a point in time. It is noted that the states 
of the encoder 300 are dictated by the states of the convolutional encoder 302 (FIG. 6). 
Since the convolutional encoder 302 has three delay elements, there are eight distinct 
states. Successive columns of nodes represent the possible states that might be defined by 
the convolutiorial encoder state liiachine at successive points in time. 

Referring to FIG. 5, the eight distinct states of ttie encoder 300 are identified by 
numerals 0 through 7, inclusive. From any given current state, each subsequent 
transmitted 4D symbol must correspond to a transition of the encoder 300 firom the given 
state to a permissible successor state. For example, fiom the current state 0 (respectively, 
fit)m current states 2, 4, 6), a transmitted 4D symbol taken fiom flie code subset sO 
corresponds to a transition to the successor state 0 (respectively, to successor states 1, 2 or 
3). Similarly, fiom current state 0, a transmitted 4D symbol taken fi-om code subset s2 
(respectively, code subsets s4, s6) corresponds to a transition to successor state 1 
(respectively, successor states 2, 3). 

Familiarity with the trellis diagram of FIG. 5, illustrates that from any even state 
(i-e., states 0, 2, 4 or 6), valid transitions can only be made to certain ones of the successor 
states, i.e., states 0, 1, 2 or 3. From any odd state (states 1, 3, 5 or 7), valid transitions can 
only be made to the remammg successor states, i.e., states 4, 5, 6 or 7. Each transition in 
the trellis diagram, also called a branch, may be tfiought of as being characterized by the 
predecessor state (the state it leaves), the successor state (the state it enters) and the 
corresponding transmitted 4D symbol A valid sequence of states is represmtcd by a path 
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through the trellis which foUows the above noted rules. A valid sequence of states 
corresponds to a valid sequence of transniittcd 4D symbols. 

At the receiving end of the communication channel, the trellis decoder 3 8 uses the 
methodology represented by the trellis diagram of FIG. 5 to decode a sequence of received 
signal samples into their symbolic representation, in accordance with the well known 
Viterbi algorithm. A traditional Viterbi decoder processes information signals iterativcly, 
on an information frame by infomiation frame basis (in the Gigabit Ethernet case, each 
information frame is a 4D received signal sample corresponding to a 4D symbol), tracing 
through a trellis diagram corresponding to the one used by the encoder, in an attempt to 
emulate the encoder's behavior. At any particular frame time, the decoder is not 
instantaneously aware of which node (or state) the encoder has reached, thus, it does not 
try to decode the node at that particular frame time. Instead, given the received sequence 
of Signal samples, the decoder calculates the most likely path to every node and determines 
the distance between each of such paths and the received sequence in order to determine a 
quantity called the path metric. 

In the next frame time, the decoder determines the most likely path to each of the 
new nodes of that frame time. To get to any one of the new nodes, a path must pass 
through one of the old nodes. Possible paths to each new node are obtained by extending 
to this new node each of the old paflis that are allowed to be thus extended, as specified by 
the trellis diagram. In the trellis diagram of FIG, 5, there are four possible paths to each 
new node. For each new node, the extended path with the smallest path metric is selected 
as the most likely path to this new node. 

By continuing the above path-extending process, the decoder determines a set of 
surviving paths to the set of nodes at the nth frame time. If all of the paths pass through 
the same node at the first frame time, then the traditional decoder knows which most likely 
node the encoder entered at the first frame time, regardless of v^diich node the encoder 
mtered at the nth fi^me time. In other words, the decoder knows how to decode tiie 
received information associated with the first frame time, even though it has not yet made 
a decision for the received information associated with the nth firame time. At the nth 
fiame time, the traditional decoder examines all surviving paths to see if they pass through 
the same first branch in the first frame time. If they do, then the valid symbol associated 
with this first brandi is outputtcd by the decoder as the decoded mformation frame for the 
first fitune time. Thra, the decoder drops the first fi:ame and takes in a new fixune for the 
XMxt iteration. Again, if all surviving paths pass through the same node of the oldest 
surviving fi:ame,dien this information frame is decoded. Hie decoder continues this 
firame-by-fiamc decoding process indefinitely so loiig as information is received. 

The number of symbols that the decoder can store is called the decoding-window 
width. The decoder must have a decoding window width large enough to ensure that a 
well-defined decision will almost always be made at a frame time. As discussed later in 
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connection with FIGs. 13 and 14, the decoding window width of the trellis decoder 38 of 
FIG, 2 is 10 symbols. This length of the decoding window is selected based on results of 
computer simulation of the trellis decoder 38. 

A decoding &ilure occurs when not all of the surviving paths to the set of nodes at 
frame time n pass through a common first branch at frame time 0. In such a case, Ae 
traditional decoder would defer making a decision and would continue tracing deeper in 
the trelUs. This would cause unacceptable latency for a high-speed system such as the 
gigabit Ethemet transceiver. Unlike the traditional decoder, the trellis decoder 38 of the 
present invention does not dieck '^^dlether the surviving paths pass through a common first 
farancL Rather, the trellis decoder, in accordance with the invention, makes an assumption 
that the surviving paths at frame time n pass through such a branch, and outputs a decision 
for frame time 0 on the basis of that assumption. If this decision is incorrect, the trellis 
decoder 38 will necessarily output a few additional mcorrect decisions based on the initial 
perturbation, but will soon recover due to the nature of the particular relationship between 
the code and the characteristics of the transmission channel. It should, further, be noted 
that this potaitial error introduction source is relatively trivial in actual practice, since the 
assumption made by the trellis decoder 38 that all the surviving paths at frame time n pass 
through a common first branch at &mn& time 0 is a correct one to a very high statistical 
probability. 

FIG. 3 is a simplified block diagram of the construction details of an exemplaiy 
trellis decoder such as described in connection with FIG. 2. The exemplary trellis decoder 
(again mdicated generally at 38) is constructed to include a multiple decision feedback 
equalizer (MDFE) 602, Viterbi decoder circuitry 604, a path metrics module 606, a path 
memory module 608, a select logic 6 1 0, and a decision feedback equalizer 612. In 
general, a Viterbi decoder is often thought of as including the path metrics module and flie 
path memory module. However, because of the unique arrangement and fimctional 
operation of the elements of the exemplary trellis decoder 38, the fimctional element 
v/ldoh performs the slicmg operation will be referred to herein as Viteibi decoder circuitry, 
a Viterbi decoder, or colloquially a Vit^bL 

The Viterbi decoder circuitry 604 performs 4D slicing of si^ials received at the 
Vitobi ii^ntts 614, and computes the brandi metrics. A branch metric, as the term is used 
herein, is well known and refers to an elemental path between neighboring Trellis nodes. 
A plurality of branch metrics will thus be understood to make up a path metric. An 
extended path metric will be understood to ref^ to a paA metric, which is extended by a 
next branch metric to thereby form an extension to the path. Based on the branch metrics 
and the previous path metrics information 618 received fiiom the path metrics module 606, 
the Vitabi decoder 604 extaids the paths and computes the extended path metrics 620 
vMch are returned to the path metrics module 606. The Viteri)i decoder 604 selects the 
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best path incoming to each of the eight states, updates the path memory stored in the path 
memory module 608 and the path metrics stored in the path metrics module 606. 

In the traditional Viterbi decoding algorithm, the inputs to a decoder are the same 
for all the states of the code. Thus, a traditional Viterbi decoder would have only one 4D 
ii^)ut for a 4D g-state code. In contrast, and in accordance with the present invention, the 
inputs 6 14 to the Viterbi decoder 604 are different for each of the eight states. This is the 
result of the feet the Viterbi inputs 614 are defined by feedback signals generated by the 
MDFE 602 and are different for each of the eight paths (one path per state) of the Viterbi 
decoder 604, as will be discussed later. 

There are eight Viterbi inputs 614 and eight Viterbi decisions 616, each 
corresponding to a respective one of the eight states of the code. Each of the eight Viterbi 
inputs 614, and each of the decision outputs 61 g, is a 4-dimensional vector whose four 
components are the Viterbi inputs and decision outputs for the four constituent 
transceivers, respectively. In other words, the four components of each of the eight Viterbi 
inputs 614 are associated wifli the four pairs of die Category-5 cable. The four 
components are a received word that corresponds to a valid codeword. From the 
foregoing, it should be understood that detection (decoding, demodulation, and the like) of 
infortnation signals in a gigabit system is inherently computationally intensive. When it is 
fiu&er realized that received information must be detected at a very high speed and in the 
presence of ISI chaimel impairments, the difiBculty in achieving robust and reliable signal 
detection will become apparent 

In accordance with the present invention, the Viterbi decoder 604 detects a non- 
binary word by first pxodiicing a set of one-dimensional (ID) decisions and a 
corresponding set of ID errors firom the 4D inputs. By combining the ID decisions with 
the ID errors, the decoder produces a set of 4D decisions and a corresponding set of 4D 
errors. Hereinafter, this generation of 4D decisions and errors from the 4D inputs is 
referred to as 4D slicing. Each of the ID errors represents the distance metric between one 
ID component of the eight 4D-inputs and a symbol in one of the two disjoint symbol- 
subsets X, Y. Each of the 4D errors is the distance between the received word and the 
corresponding 4D decision which is a codeword nearest to the received word with respect 
to one of the code-subsets si, where i=0,..7. 

4D errors may also be charactmzed as the branch metrics in the Viterbi algorithoL 
The branch metrics are added to the previous values of path metrics 618 received from the 
path metrics module 606 to form the extended path metrics 620 which are then stored in 
the path metrics module 606, replacing the previous path metrics. For any one given state 
of the eight states of the code, there are four incoming paths. For a given state, the Viterbi 
decoder 604 selects the best path, i.e., the path having the lowest metric of the four paths 
incoming to that state, and discards the other three pa&s. The best path is saved in the 
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path memory module 608. The metric associated with the best path is stored in the path 
metrics module 606, replacing the previous value of the path metric stored in that module. 

In the following, the 4D slicing function of the Viterbi decoder 604 will be 
described in detail. 4D sUdng may be described as being performed in three sequential 
steps. In a first step, a set of ID decisions and corresponding ID errors are generated from 
the 4D Viterbi mputs. Next, the ID decisions and ID errors are combmed to form a set of 
2D decisions and correspondmg 2D errors. FinaUy, the 2D decisions and 2D errors are 
combined to form 4D decisions and corresponding 4D enois. 

FIG. 7 is a simplified, conceptual block diagram of a first exemplary embodiment 
of a ID slicing fimction such as might be implemented by the Viterbi decoder 604 of FIG. 
3. Referring to FIG. 7. a ID component 702 of the eight 4D Viterbi inputs (614 of FIG. 3) 
is sliced, i.e., detected, in paiaUel feshion, by a pair of ID slicers 704 and 706 with respect 
to the X and y symbol-subsets. Each slicer 704 and 706 outputs a respective ID decision 
708 and 710 with respect to the appropriate respective symbol-subset X, Y and an 
associated squared error value 712 and 714. Each ID decision 708 or 710 is the symbol 
which is closest to the ID input 702 in the appropriate symbol-subset X and Y, 
respectively. The squared error values 712 and 714 each represent the square of the 
difference between the ID input 702 and theu^ respective ID decisions 708 and 710. 

The ID slicmg fimction shown in FIG. 7 is performed for aU four constituent 
transceivers and for all eight states of the trellis code in order to produce one pair of ID 
decisions per transceiver and per state. Thus, the Viterbi decoder 604 has a total of 32 
pairs of ID sheers disposed m a manner identical to the pair of sheers 704, 706 illustrated 
in FIG. 7. 

FIG. 8 is a sunplified block diagram of a second exemplary embodiment of 
cueuitry capable of implementing a ID sUcmg fimction suitable for incorporation m the 
Viterbi decoder 604 of FIG. 5. Referring to FIG. 8, the ID component 702 of die eight 4D 
Vitcriji inputs is shced, i.e., detected, by a first pair of ID sheers 704 and 706, witii respect 
to the X and Y symbol-subsets, and also by a 5-level sheer 805 wifli respect to the symbol 
set v*ich represents the five levels (+2, +1, 0, -1 , -2) of the constellation, i.e., a union of 
the X and Y symbol-subsets. As m the previous case desCTibed in connection widi FIG. 7, 
the sUcers 704 and 706 output ID decisions 708 and 710. The ID decision 708 is the 
symbol which is nearest the ID input 702 in the symbol-subset X while ID decision 710 
corresponds to the symbol which is nearest the ID mput 702 in the symbol-subset Y. The 
output 807 of the 5-level sheer 805 corresponds to the particular one of flie five 
constellation symbols which is determined to be closest to flie ID input 702. 

TTie diflfereaice between each decision 708 and 710 and the 5-level slicer ou^t 
807 is processed, in a manner to be described m greater detail below, to generate 
respective quasi-squared error terms 812 and 814. In contrast to the ID error tenns 712, 
714 obtained witii the first exen^laiy embodunent of a ID sheer dqiicted in FIG. 7, the 
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ID enor terms 812, 814 generated by the exemplary embodiment of FIG. 8 are more easily 
adapted to discenung relative diflFerences between a ID decision and a ID Viteibi input. 

In particular, the slicer embodiment of FIG/7 may be viewed as perfbnning a "soft 
decode", with ID error terms 712 and 714 represented by EucUdian metrics. The slicer 
embodunent depicted in FIG; 8 may be viewed as performing a "hard decode", with its 
respective ID error terms 812 and 814 expressed in Hamming metrics (i.e., 1 or 0). Thus, 
there is less ambiguity as to whether the ID Viterbi input is closer to the X symbol subset 
or to the Y symbol subset Furthermore, Hamming metrics can be expressed in a fewer 
number of bits, than EucUdian metrics, resulting in a system that is substantially less 
computationally complex and substantially faster. 

In the exemplary embodunent of FIG. 8, error terms are generated by combining 
the output of the five level slicer 805 with the outputs of the ID slicers 704 and 706 in 
respective adder circuits 809A and 809B. The outputs of the adders are directed to 
respective squared magnitude blocks 81 1 A and 81 IB which generate the binary squared 
error terms 812 and 814, respectively. 

Implementation of squared error terms by use of circuit elements such as adders 
809A, 809B and the magnitude squared blocks 81 lA, 81 IB is done for descriptive 
convenience and conceptual illustration purposes only. In practice, squared error term 
definition is implemented with a look-iq) table that contains possible values for enor-X 
and error-Y for a given set of dedsion-X, decision- Y and Viterbi input values. The look- 
up table can be implemented with a read-only-memory device or alternatively, a random 
logic device or PLA. Examples of look-up tables, suitable for use in practice of the 
present invention, are illustrated in FIGs. 17, 18A and 18B. 

Hie ID slicing fimction exemplified in FIG. 8 is performed for all four constituent 
transceivers and for all eight states of the trellis code in order to produce one pair of ID 
decisions per transceiver and per state. Thus, the Viterbi decoder 604 has a total of fliirty 
two pairs of ID slicers that correspond to the pair of slicers 704, 706, and thirty two 5- 
level slicers Oat correspond to the S-level slicer 805 of FIG. 8. 

Each of the ID errors is represented by substantially fewer bits than each ID 
component of the 4D inputs. For example, in the embodiment of FIG. 7, the ID 
component of the 4D Viterbi input is rqjresented by 5 bits, wfaUe the ID error is 
represented by 2 or 3 bits. TraditionaUy, proper soft decision decoding of such a treUis 
code would require that the distance metric (EucUdean distance) be represented by ^ to 8 
bits. One advantageous feature of the present mvention is that only 2 or 3 bits are required 
for the distance metric in soft decision decoding of this trellis code. 

In the embodiment of FIG. 8, the ID error can be represented by just 1 bit It is 
noted that since the ID error is represented by 1 bit the distance metric used in tins treUis 
decoding is no longa- the Euclidean distance, which is usually associated witii trellis 
decoding, but is instead Ae Hamming distance, which is usually associated with hard 
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decision decoding of binary codewords. This is another particularly advantageous feature 
of the present invention. 

FIG. 9 is a block diagram illustrating the generation of the 2D errors ftom the ID 
errors for twisted pairs A and B (corresponding to constituent transceivers A and B). 
Since the generation of errors is similar for twisted pairs C and D, this discussion will only 
concern itself with the A:B 2D case. It will be understood that the discussion is equally 
applicable to the C:D 2D case with the appropriate change in notation. Referring to FIG. 
9, ID error signals 712A, 712B, 714A, 714B might be produced by the exemplary ID 
slicing functipnal blocks shown in FIGs. 7 or 8. The ID error term signal 712A (or 
respectively. 712B) is obtained by slicing, with respect to symbol-subset )^ the 1 D 
component of the 4D Viterbi input, vMch corresponds to pair A (or respectively, pair B). 
The ID error term 714A (respectively, 714B) is obtained by slicing, with respect to 
symbol-subset Y, the ID component of the 4D Viterbi mput, wiiich corresponds to pair A 
(respectively, B). The ID errors 712A, 712B, 714A, 714B are added according to aU 
possible combinations (XX, XY, YX and YY) to produce 2D error terms 902 AB, 904 AB, 
906AB, 908AB for pairs A and B. Similarly, the ID enx)rs 712C, 712D, 714C, 714D (not 
shown) are added according to the four diflfercnt symbol-sub^t combinations XX, XY, 
YX and YY) to produce corresponding 2D eiror terms for wire pairs C and D. 

FIG. 10 is a block diagram illustrating Ae generation of the 4D errors and extended 
path metrics fiar the four extended paths outgoing firpm state 0. Referring to FIG. 10, the 
2D errors 902AB, 902CD, 904AB, 904CD, 906AB, 906CD, 908 AB, 908CD are added in 
pairs according to eight different combinations to produce eight intermediate 4D errors 
1002, 1004, 1006, 1008, 1010, 1012, 1014, 1016. For example, the 2D error 902AB, 
which is the squared error with respect to XX fiom paiis A and B, are added to the 2D 
error 902CD, v^iiich is the squared error with respect to XX fiom pairs C and D, to form 
the mtermediate 4D error 1002 \^ch is the squared error with r^pect to sub-subset 
XXXX for pairs A, B, C and D. Similarly, the intermediate 4D error 1004 which 
corresponds to the squared error with respect to sub-subset YYYY is formed from the 2D 
errors 908AB and 908CD. 

The eight intermediate 4D errors arc grouped in pairs to correspond to the code 
subsets sO, s2, s4 and s6 represented in FIG. 4B. For example, the intemiediate 4D errors 
1002 and 1004 are grouped together to correspond to die code subset sO vAnch is formed 
by the union of the XXXX and YYYY sub-subsets. From each pair of intennediate 4D 
ators, the one with the lowest value is selected (the other one being discarded) in order to 
provide the braiKh metric of a transition in the trellis diagram fiom state 0 to a subsequent 
state. It is noted that, according to the trellis diagram, transitions fiom an even state (i.e., 
0, 2, 4 and 6) are only allowed to be to the states 0, 1,2 and 3, and transitions from an odd 
state (i.e., 1, 3, 5 and 7) are only allowed to be to the states 4, 5, 6 and 7. Each of the 
index signals 1026, 1028, 1030, 1032 indicates which of the 2 sub-subsets the selected 
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intennediate 4D error corresponds to. The branch metrics 1018, 1020, 1022, 1024 are the 
, branch metrics for the transitions in the trellis diagram of FIG. 5 associated with code- 
subsets sO, s2, s4 and s6 respectively, from state 0 to states 0, 1 , 2 and 3, respectively. Hie 
branch metrics are added to the previous path metric 1 000 for state 0 in order to produce 
the extended path metrics 1034, 1036, 1038, 1040 of the four extended paths outgoing 
from state 0 to states 0, 1, 2 and 3, respectively. 

Associated with the eight intennediate 4D errors 1002, 1004, 1006, 1008, 1010, 
1012, 1014, 1016 are the 4D decisions vMch are formed from the ID decisions made by 
one of the exemplary slicer embodhnents of FIG. 7 or 8. Associated with the branch 
metrics 1018, 1020, 1022, 1024 are the 4D symbols derived by selecting the 4D decisions 
using the index ou^uts 1026, 1028, 1030, 1032. 

FIG. 1 1 shows the generation of the 4D symbols associated with the branch metrics 
1018, 1020, 1022, 1024. Referring to FIG. 1 1 , the ID decisions 708A, 708B, 708C. 708D 
are the ID decisions with respect to symbol-subset X (as shown in FIG. 7) for constituent 
transceivers A, B, C, D, respectively, and the ID decisions 710A, 710, 710C, 710D are the 
ID decisions with respect to symbol-subset Y for constituent transceivers A, B, C and D, 
respectively. The ID decisions are concatenated according to the combinations which 
correspond to a left qr right hand portion of the code subsets sO, s2, s4 and s6, as depicted 
in FIG. 4B. For example, flie ID decisions 708A, 708B, 708C, 708D are concatenated to 
correspond to tfie left hand portion, XXXX, of the code subset sO. The 4D decisions are 
grotq)ed in pairs to correspond to the union of symbol-subset portions making up the code 
subsets sO, s2, s4 and s6. In particidar, the 4D decisions are grouped together to 
correspond to the code subset sO which is formed by the union of the XXXX and YYYY 
subset portions. 

Referring to FIG. 1 1, the pairs of 4D decisions are inputted to the multiplexers 
1120, 1122, 1124, 1126 which receive the index signals 1026, 1028, 1030, 1032 (FIG. 10) 
as select signals. Each of the multiplexers selects from a pair of the 4D decisions, the 4D 
dedsion which corresponds to the sub-subset indicated by the corresponding index signal 
and oulputs the selected 4P decision as the 4D symbol for the branch whose branch metric 
is associated with the index signal. The 4D symbols 1 130, 1 132, 1 134, 1 136 correspond 
to the transitions in the trellis diagram of FIG. 5 associated with code-subsets sO, s2, s4 
and s6 respectively, fiom stale 0 to states 0, 1,2 and 3, respectively. Eachofthe4D 
symbols 1 130, 1 132, 1 134, 1136 is the codeword in the corresponding code-subset (sO, s2, 
s4 and s6) ^ch is closest to the 4D Viterbi input for state 0 (there is a 4D Viterbi input 
for each state). The associated branch metric (FIG. 10) is the 4D squared distance between 
the codeword and the 4D Viterbi input for state 0. 

FIG. 12 illustrates the selection of the best path incoming to state 0. The extended 
path metrics of the four paths incoming to state 0 from states 0, 2, 4 and 6 are inputted to 
the comparator module 1202 \^ch selects the best path, i.e., the path with the lowest path 
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metric, and outputs the Path 0 Select signal 1206 as an indicator of this path selection, and 
the associated path metric 1204. 

The procedure described above for processing a 4D Viterbi input for state 0 of the 
5 code to obtain four branch metrics, four extended path metrics, and four corresponding 4D 

symbols is similar for the other states. For each of the other states, the selection of the best 
path ftom the four incoming paths to that state is also similar to the procedure described in 
connection with FIG. 12. 

The above discussion of the computation of the branch metrics, illustrated by FIG. 

2Q 7 through 1 1 , is an exemplary plication of the method for slicing (detecting) a received 
L-dimensional word and for computing the distance of the received L-dimensional word 
fiom a codeword, for the particular case where L is equal to 4. 

In general terms, i.e., for any value of L greater than 2, the method can be 
described as follows. The codewords of the trellis code are constellation points chosen 

j5 from 2^^ code-subsets. A codeword is a concatenation of L symbols selected fiom two 
disjoint symbol-subsets and is a constellation point bclongmg to one of the 2^^ code- 
subsets. At tfie receiver, L inputs are received, each of the L mputs uniquely 
conespondmg to one of the L dunensions. The received word is formed by the L inputs. 
To detect the received word, 2^' identical input sets are formed by assigning the same L 

20 inputs to each of flie 2*"' input sets, Eadi of the L inputs of each of the 2^'' mput sets is 

sliced with respect to each of the two disjomt symbol-subsets to produce an error set of 2L 
one-dimensional errors for each of the 2^*' code-subsets. For the particular case of the 
trellis code of tiie type described by the trellis diagram of FIG. 5, the one-dimensipnal 
errors are combmed wifltin each of tiie 2^' error sets to produce 2^^ L-dimensional errors 

25 for the corresponding code-subset such tiiat each of the 2*-"^ L-dimensional errors is a 

distance between the received word and one of the codewords in the corresponding code- 
subset 

One embodiment of this combining operation can be described as follows. First, 
the 2L one-dimensioruil errors are combined to produce 2L two-dimensional errors (FIG. 

30 T^en, the 2L two-dimerisiorial enors are combined to pn^duce 2^ mtermediate L- 

dunensional mors vMoh are arranged into 2^^ pairs of errors such that these pairs of 
errors correspond one-to-one to the 2^^ code-subsets (FIG. 10, signals 1002 through 1016). 
A minim u m is selected fi>r eadi of the 2^' pairs of errors (FIG. 10, sigrials 1026, 1028, 
1030, 1032). These minima are the 2^' L-dimensional errors. Due to the constraints on 

35 transitions from one state to a successor state, as shown m the trellis diagram of FIG. 5, 
only half of the 2^^ L-dimeasional errors correspond to allowed transitions in the trellis 
diagram. These 2^^ L-dimmsional errors are associated witii 2^^ L-dim^^ 
Eadi of the 2" L-dimensional decisions is a codeword closest in distance to the received 
word (the distance bemg rq)rcsented by one of the 2^^ L-dimensional errors), the 
codeword being in one of half of the 2^^ code-subsets, Le., in one of 2^^ code-subsete of 
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the Z"-' code-subsets (due to the particular constraint of the trellis code described by the 
trellis diagram of nO. 5). 

It is important to note that the details of the combining operation on the 2L one- 
dimensional errors to produce the final L-dimensional errors and the number of the final 
L-dimensional errors are functions of a particular trellis code. In other words, they vary 
depending on the particular trelUs code. 

HG. 13 iUustrates the construction of the path memory module 608 as 
implemented in the embodiment of FIG.6. The path memory module 608 includes a path 
memory for each of the eight paths. In Ae iUustrated embodiment of the invention, the 
path memory for each path is implemented as a register stack, ten levels in depth. At each 
level, a 4D symbol is stored in a register. The number of path memoiy levels is chosen as 
a tradeoff between receiver latency and detection accuracy. FIG. 13 only shows the path 
memory for path 0 and continues with the example discussed in FIGs. 7-12. FIG. 13 
illustrates how the 4D decision for the path 0 is stored in the path memory module 608 
and how the Path 0 Select signal, i.e., the information about which one of die four 
incoming extended paths to state 0 was selected, is used m the corresponding path memory 
to force merging of the paths at aU depth levels (levek 0 through 9) in the path memory. 

Referring to FIG. 13, each of the ten levels of the path memoiy includes a 4^to.l 
multiplexer(4:l MUX)andaregistertostorea4Ddecision. The registers are numbered 
according to then- dqith levels. For example, register 0 is at depth level 0. The Path 0 
Select signal 1206 (FIG. 12) is used as die select input for the 4:1 MUXes 1302, 1304, 
1306,.... 1320. The 4D decisions 1130, 1132, 1134, 1136 (FIG. 11) are inputt^tothe 
4:1 MUX 1302 which selects one of the four4D decisions based on the Path 0 select 
signal 1206 and stores it in the register 0 of path 0. One symbol period later, the register 0 
of path 0 ou^uts the selected 4D decision to the 4: 1 MUX 1 304. The other three 4D 
decisions inputted to the 4:1 MUX 1304 are fiom the registers 0 of paths 2, 4, and 6. 
Based on the Path 0 Select signal 1206, the 4:1 MUX 1304 selects one of the four 4D 
decisions and stores it in die registerl of path 0. One symbol period later, the register 1 of 
path 0 outputs the selected 4D decision to the 4:1 MUX 1306. The other three 4D 
decisions inputted to the 4:1 MUX 1306 are from the registers 1 of paths 2. 4, and 6. 
Based on the Path 0 Select signal 1206, the 4:1 MUX 1306 selects one of the four 4D 
decisions and stores it in the register 2 of path 0. This procedure continues for levels 3 
through 9 of the path memory for path 0. During continuous operation, ten 4D symbols 
representing path 0 are stored in registers 0 through 9 of the path memory for path 0. 

Similariy to path 0, each of the paths 1 though 7 is stored as ten 4D symbols in the 
registers of the corresponding path memoiy. The connections between the MUX of one 
path and registers of different paths follows the treUis diagram of HG. 2. For example, the 
MUX at level k for path 1 receives as inputs the outputs of the registers at level k-1 for 
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I^ths 1,3,5, 7, and the MUX at level k for path 2 receives as inputs the outputs of the 
registers at level k-1 for paths 0, 2, 4, 6. 

FIG, 14 is a block diagram illustrating the computation of the final decision and the 
tentative decisions in the path memory module 608 based on the 4D symbols stored in the 
paih memory for each state. At each iteration of the Viterbi algorithm, the best of Ae 
eigjit states, Le., the one associated vwth the path having the lowest path metric, is selected, 
and the 4D symbol firom the associated path stored at the last level of the path memory is 
selected as the final decision 40 (FIG. 3). Symbols at lower depth levels are selected as 
tentative decisions, vMch are used to feed the delay line of the DFE 612 (FIG. 3). 

Referring to FIG. 14, the path metrics 1402 of the eight states, obtained from the 
procedure of FIG. 12, are inputted to tiie coniparator module 1406 vrfiich selects the one 
with the lowest value and provides an indicator 1401 of this selection to the select inputs 
of the 8-to-l multiplexers (8:1 MUXes) 1402, 1404, 1406, 1420, which are located at 
path memory dq>th levels 0 through 9, respectively. Each of the 8:1 MUXes receives 
eight 4D symbols ou^utted from corresponding registers for the eight paths, the 
corresponding registers being located at the same depth level as the MUX, and selects one 
of the eight 4D symbols to ou^ut, based on the select signal 1401. The ou^uts of the 8:1 
MUXes located at depth levels 0 through 9 are Vq, V„ Vj, V,, respectively. 

In the illustrated embodiment, one set of eight signals, output by the first register 
set (the register 0 set) to the first MUX 1402, is also taken ofiF as a set of eight outputs, 
denoted Vo' and provided to the MDFE (602 of FIG. 3) as a select signal vsiiich is used in a 
manner to be described below. Although only the first register set is illustrated as 
providing outputs to the DFE, the invention contemplates the second, or even higher order, 
register sets also providing similar outputs. In caga^^ere multiple register sets provide 
ou^uts, these are identified by the register set depth order as a subscript, as in VjV and the 
like. 

In the illustrated embodiment, the MUX ou^uts Vq, V„ V2 are delayed by one unit 
of time, and are thra provided as the tentative decisions Y^, V,p, V^f to the DFE 612. The 
nuniber of the ou^uts Vj to be used as tentative decisions depends on the required 
accuracy and speed of decoding operation. After fiirther delay, the output of the first 
MUX 1402 is also provided as the 4D tentative decision 44 (FIG. 2) to the Feedforwaid 
Equalizers 26 of the four constituent transceivers and tiie timmg recovery block 222 (FIG. 
2). The 4D symbol V^^, which is the output of tiie 8: 1 MUX 1420 delayed by one time 
xmit, is provided as the final decision 40 to the receive section of the PCS 204R (FIG. 2). 

The following is the discussion on how outputs Vo', V/, Vqf, V^p, of the pafli 
memory module 608 might be used in the select logic 610, the MDFE 602, and tiie DFE 
612 (FIG. 3). 

FIG. 15 is a blodc: level diagram of the ISI compensation portion of the decoder, 
including construction and operational details of the DFE and MDFE circuitry (612 and 
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602 of FIG. 3, respectively). The ISI compensation embodiment dqjicted in FIG. 15 is 
adapted to receive signal samples from the deskew memoiy (36 of FIG. 2) and provide ISI 
compensated signal samples to the Viterbi (sUcer) for decoding. The embodiment 
iUustrated in FIG. 15 includes the Viterbi block 1502 (which includes the Viterbi decoder 
604, Ae path metrics module 606 and the path inemoiy module 608), the select logic 610, 
the MDFE 602 and the DFE 612. 

The MDFE 602 computes an independent feedback signal for each of the paths 
stored in the path memoiy module 608. These feedback signak represent different 
hypotheses for the intersymbol inteafoence component piesemt in ttic input 37 (FIGs. 2 
and 6) to the trellis decoder 38. The diflferent hypotheses for the intereymbol interference 
component correspond to the difiGsient hypotheses about the previous symbols which are 
rqiresented by the different paths of the Viterbi decoder. 

The Viterbi algorithm tests these hypotheses and identifies the most likely one. It 
is an essential aspect of tiie Viterbi algorithm to postpone this identifying decision until 
there is enough information to minimize tiie probabiUty of error in the decision. In the 
meantime, all the possibilities are kept open. Ideally, the MDFE block would use the 
entire pafli memory to compute the different feedback signals using the entire lengtii of the 
path memory. In practice, this is not possible because this would lead to unacceptable 
complexity. By "unacceptable", it is meant requiring a very large number of components 
and an extremely complex interconnection pattern. 

TTierefore, in the exemplary embodiment, the part of the feedback signal 
computation tiiat is performed on a per-patfa basis is limited to tiie two most recent 
symbols stored in register set 0 and register set 1 of all paths in the patii memory module 
608, namely Vo' and V,* with i=0,...,7, indicating tiie patii. For symbols older tiian two 
periods, a hard decision is forced, and only one replica of a "tail" component of the 
intersymbol interference is computed. This results in some marginal loss of performance, 
but is more tiian adequately compensated for by a simpler system implementation. 

The DFE 612 computes this "imi" component of the intersymbol interference, 
based on the tentative decisions Vop, Vu^, and V„. The reason for usmg three different 
tentative decisions is that the reliability of the decisions mcreases wifli flie increasing 
deptii into flie pafli memory. For example, V,f is a more reliable version of Vop delayed by 
one symbol period. In tiie absence of errors, V,f would be always equal to a delayed 
version of Vop. In the presence of errors. V, f is different from Vop, and flie probabiUty of 
V,F being in error is lower flian flie probability of V^ being in error. Sirnilarly, Vjp is a 
more reliable delayed vecaon of V,p. 

Referring to FIG. 15, flie DFE 612 is a filter having 33 coefficients Co fluough 
corresponding to 33 taps and a delay line 1504. The delay line is constructed of 
sequentially disposed summing junctions and delay elements, sudi as registers, as is well 
understood in flie sat of filter design. In flie illustrated embodiment, the coefficients of the 
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DFE 612 are updated once eveiy four symbol periods, i.e., 32 nanoseconds, in well known 
fiashion, using the well known Least Mean Sqiiares algorithm, based on a decision input 
1505 from the Vitcrbi block and an OTor input 42dfe, 

The symbols V^p, V,f , and Vjp are "jammed", meaning inputted at various 
locations, into the delay line 1504 of the DFE 612. Based on these symbols, the DFE 612 
produces an intersymbol interference (ISI) replica portion associated with all previous 
symbols except the two most recent (since it was derived without using the first two taps 
of the DFE 612). The ISI repUca portion is subtracted from the output 37 of the deskew 
memory block 36 to produce the signal 1508 v^4iich is then fed to the MDFE block. The 
signal 1508 is denoted as the "tail" component in FIG. 3. In the illustrated embodiment, 
the DFE 612 has 33 taps, numbered from 0 through 32, and the tail component 1508 is 
associated with taps 2 through 32. As shown in FIG. 1 5, due to a circuit layout reason, the 
tail component 1508 is obtained in two steps. First, flie ISI replica associated with taps 3 
through 32 is subtracted from the deskew memory output 37 to produce an intermediate 
signal 1507. Then, the ISI replica associated with the tap 2 is subtracted from the 
intermediate signal 1507 to produce the tail component 1508. 

The DFE 612 also computes the ISI replica 15i0 associated with the two most 
recent symbols, based on tentative decisions Vqf, Vjp, and Y^f- This ISI replica 1510 is 
subtracted from a delayed vision of the output 37 of the deskew memory block 36 to 
provide a soft decision 43. The tratative decision Vojr is subtracted from the soft decision 
43 in order to provide an error signal 42. Error signal 42 is fiirther processed into several 
additional representations, identified as 42enc, 42ph and 42dfe. The error 42enc is 
provided to the echo cancelers and NEXT cancelers of the constituent transceivers. The 
error 42ph is provided to the FFEs 26 (FIG. 2) of the four constituent transceivers and the 
timing recovery block 222. The error 42dfe is directed to the DFE 612, where it is used 
for the adaptive updating of the coeflSdents of the DFE togetho- with the last tentative 
decision from the Viterbi block 1502. The tentative decision 44 shown in FIG. 3 is a 
delayed va:sion of Vqf. The soft decision 43 is ou^utted to a test inter&ce for display 
purposes. 

The DFE 612 provides the tail component 1508 and the values of the two "initial" 
coefficients Qj and C, to the MDFE 602. The MDFE 602 computes eight different 
replicas of die ISI associated with the first two coefGcients of the DFE 612. Each of these 
ISI replicas corresponds to a different path in die path memory module 608. This 
computation is part of the so-called "critical path" of the trellis decoder 38, in other words, 
the sequence of computations that must be completed in a single symbol period. At the 
speed of operation of the Gigabit Ethonet transceivers, the symbol period is 8 
nanoseconds. All the challenging computations for 4D slicing, branch metrics, path 
extensions, selection of best path, and update of path memory must be completed within 
one symbol period. In addition, before these compiitations can even begin, &e MDFE 602 
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must have completed the computation of the eight 4D Viterbi inputs 614 (FIG. 3) which 
involves computing the ISI repUcas and subtracting them fiom the ou^ut 37 of the de- 
skew memory block 36 (FIG. 2). This bottleneck in the computations is veiy difficult to 
resolve. The system of the present mvention allows the computations to be carried out 
smoothly in tile allocated time. 

Referring to FIG. 15. the MDFE 602 provides ISI compensation to received signal 
samples, provided by the deskew memory (37 of FIG. 2) before providing them, in turn, to 
the input of the Viterbi block 1502. ISI compensation is performed by subtracting a 
multipUcity of derived ISI rcpUca components ftom a received signal sample so as to 
develop a multipUcity of signals that, together, represents various expressioDs of ISI 
compensation that might be associated with any arbitrary symbol. One of the ISI 
compensated arbitrary symbolic representations is then chosen, based on two tentative 
decisions made by the Viterbi block, as the iiiput signal sample to the Viterbi. 

Since the symbols under consideration belong to a PAM-5 alphabet, they can be 
expressed in one of only 5 possible values (-2, -1. 0, +1, +2). Representations of these five 
values are stored m a convolution engine 151 1. where they are convolved with the values 
of the first two filter coefficients Q and C, of the DFE 612. Because there are two 
coefficient values and five level representations, the convolution engme 1511 necessarily 
gives a twenty five value result that might be expressed as (ajQ + bjC,), with Co and C, 
representing the coefficients, and with a, and bj representing the level expressions (with 
1=1,2.3,4.5 and j=1^3,4,5 ranging independently). 

These twenty five values are negatively combined with the tail component 1508 
received from the DFE 612. The taU component 1508 is a signal sample from which a 
partial ISI component associated with taps 2 through 32 of the DFE 61 2 has been 
subtracted. In eflfect. the MDFE 602 is operating on a partiaUy ISI compensated (pre- 
oompensated) signal sample. Each of the twenty five pre^x)mputed values is subtracted 
from the partiaUy compensated signal sample in a respective one of a stack of twenty five 
summing junctions. The MDFE flien saturates the twenty five results to make them fit in a 
predetermined range. This saturation process is done to reduce the number of bits of each 
of the ID components of the Viterbi input 614 in order to fecUitate lookup table 
computations of branch metrics. Tlie MDFE 602 then stores the resultant ISI compensated 
signal samples m a stack of twenty five registers, which makes the samples avaUable to a 
25:1 MUX for mput sample selection. One of the contents of the twenty five registers wiU 
correspond to a component of a 4D Viterbi input with the ISI correctly cancelled, provided 
that there was no decision error (meaning the hard decision regarding the best path forced 
^xpon taps 2 through 32 of the DFE 612) in the computation of the tail component In the 
abseiice of noise, this particular value wUl coincide with one of the ideal 5-level symbol 
values (i.e., -2, -1, 0. 1, 2). In practice, thae will always be noise, so this value wiU be in 
general different than any of the ideal symbol values. 
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This ISI compensation scheme can be ejqjanded to accommodate any nimiber of 
symbolic levels. If signal processing were perfonned on PAM-7 signals, for example, the 
convolution aigine 151 1 would ou^ut forty nine values, i.e., a, and bj would range fiom 1 
to 7. Enor rate could be reduced, i.e., poformance could be improved, at the expense of 
greater system complexity, by increasing the number of DFE coefficients inputted to the 
convolution engine 1511, The reason for this improvement is that the foix^ed hard decision 
(regarding the best path forced upon taps 2 through 32 of the DFE 6 12) that goes into the 
"tail" computation is delayed. If C2 were added to the process, and the symbols are again 
ejqjressed in a PAM-5 alphabet, the convolution engine 151 1 would output one hundred 
twenty five (125) values. Error rate b reduced by decreasing the tail component 
computation, but at the expense of now requiring 125 summing junctions and registers, 
anda 125:1 MUX. 

It is important to note that, as inputs to the DFE 612, the tentative decisions Vop, 
Vu„ are time sequences, and not just instantaneous isolated symbols. If there is no 
error in the tentative decision sequence Vqf. then the time sequence Vjf will be the same 
as flie time sequence Y„ delayed by one time unit, and the same as the time sequence Vqf 
delayed by two time units. However, due to occasional decision error in the time sequence 
VoF, v/tdch may have been corrected by the more reliable time sequence V,f or Vir, time 
sequences Vjp and Vjp may not exactly correspond to time-shifted versions of time 
sequence Y^^. For this reason, instead of using just one sequence Vop, all three sequences 
VoF, V,p and Vj^ are used as inputs to the DFE 612. Although this implementation is 
essentiaUy equivalent to convolving Vof with all the DFE's coefficients when there is no 
decision error in Vop, it has the added advantage of reducing the probabiKty of introducing 
a decision error into the DFE 612. It is noted that other tentative decision sequences along 
flie depth of the path memory 608 may be used instead of the sequences Vof, V,f and Yj^. 

Tentative decisions, developed by the Viterbi, are taken fiom selected locations in 
the path memory 608 and "jammed" into the DFE 612 at various locations along its 
computational path. In the illustrated embodiment (FIG. 15), the tentative decision 
sequence is convolved with the DFE's coefficients Co through C3, the sequence V,f is 
convolved with the DFE's coefficients C4 and C,, and the sequence Y„ is convolved with 
the DFE's coefficients Q through C32. It is noted that, since the partial ISI component that 
is subtracted fiwm the deskew memory output 37 to form tihe signal 1508 is essentially 
taken (in two steps as described above) fiom tap 2 of the DFE 612, this partial ISI 
component is associated with the DFE's coefficients Cj through Cjj. It is also noted that, 
in another embodiment, instead of using the two-step computation, this partial ISI 
component can be directly taken fiom the DFE 612 at point 1515 and subtracted fiom 
signal 37 to form signal 1 508. 

It is noted that the sequences Vop, V,f, V^f correspond to a hard decision regarxiing 
the choice of the best path among the eigjit paths (patb. i is flie path ending at state i). 
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Thus, the partial ISI component associated with the DFE's coefficients through Cj, is 
Ae result of forcing a hard decision on the group of higher ordered coefficients of the DFE 
612. ITie underlying reason for computing only one partial ISI signal instead of eight 
complete ISI signals for the eight states (as done conventionaUy) is to save in 
computational complexity and to avoid timing problems. In effect, the combination of the 
DFE and the MDFE of the present invention can be thought of as performing the functions 
of a group of eight different conventional DFEs having the same tap coefficients except 
for the first two tap coefficients. 

For each state, there remains to determine which path to use for die remaining two 
coefficients in a very short interval of time (about 16 nanoseconds). This is done by the 
use of the convolution engine 15 II add the MDFE 602. It is noted that the convolution 
engine 1511 can be implemented as an integral part of the MDFE 602. It is also noted 
that, for each constituent transceiver, i.e., for each ID component of the Viterbi input 614 
(the Viterbi input 614 is practically eight 4D Viterbi inputs), there is only one convolution 
engine 151 1 for all the eight states but there are eight repUcas of the select logic 610 and 
eight replicas of the MUX 1512. 

The convolution engine 151 1 computes all the possible values for the ISI 
associated with the coefficients Q and C,. There arc only twenty five possible values, 
since this ISI is a convolution of these two coefficients with a decision sequence of length 
2, and each decision in the sequence can only have five values (-2, -1, 0, +1, +2). Only 
one of tiiese twenty five values is a correct value for this ISI. These twenty five 
IqiJOtheses of ISI are dien provided to the MDFE 602. 

In the MDFE 602, the twenty five possible values of ISI are subtracted bom the 
partial ISI compensated signal 1508 using a set of adders connected in parallel. The 
resulting signals are then saturated to fit in a predetermined range, using a set of saturators 
The saturated results are then stored in a set of twenty five registers. Provided that there 
was no decision error regarding the best path (among the eight paths) forced upon taps 2 
through 32 of the DFE 612, one of the twenty five registers would contain one ID 
component of the Viterbi input 614 with the ISI correctiy cancelled for one of the eight 
states. 

For each of the eight stales, the generation of the Viterbi input is limited to 
selecting the correct value out of these 25 possible values. This is done, for each of the 
eight states, using a 25-to-l multiplexer 1512 whose select input is the output of the select 
logic 610. The select logic 610 receives and (i=0....,7) for a particular state i fiom 
the path memory module 608 of the Viterbi block 1502. The select logic 610 uses a pre- 
computed tookup table to determine the value of the select signal 622A based on the 
values of » and fi)r the particular state i. TTie select signal 622A is one component 
of the 8-component select signal 622 shown in FIG. 3. Based on the select signal 622A, 
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the 25-to-l multiplexer 1512 selects one of the contents of the twenty five registers as a 
ID component of the Viterbi input 614 for the corresponding state i. 

no. 15 only shows the select logic and the 25.to-l multiplexer for one state and 
for one constituent transceiver. There arc identical select logics and 25-to-l multiplexers 
for the eight states and for each constituent transceiver. In other words, the computation of 
the 25 values is done only once for aU the eight states, but the 25:1 MUX and the select 
logic are lephcated eight times, one for each state. The input 614 to the Viterbi decoder 
604 is, as a practical matter, eight 4D Viterbi mputs. 

In the case of the DFE, however, only a single DFE is contemplated for practice of 
the mventioa In contrast to alternative systems where eight DFEs arc required, one for 
each of the eight states imposed by the treUis encoding scheme, a single DFE is'sufficient 
smce the decision as to which path among the eight is the probable best was mSde in the 
Viterbi block and forced to the DFE as a tentative decision. State status is maintamed at 
the Viteibi decoder input by controlling the MDFEoutput with the state specific signals 
developed by the 8 select logics (610 of FIG. 3) m response to the eight state specific 
signals Vo' and V,'. i=0....,7, fiom the padi memoiy module (608 of FIG. 3). Although 
identified as a singular DFE. it will be understood that the 4D architectural requirements 
of the system means that the DFE is also 4D. Each of the four dimensions (twisted pairs) 
wiU exhibit thek own independent contributions to ISI and these should be dealt with 
accoidmgly. Thus, the DFE is singular, with respect to state architechire, when its 4D 
nature is taken into account 

In the architecture of the system of the present invention, the Viterbi input 
computation becomes a very smaU part of the critical path since the multiplexers have 
extremely low delay due largely to the placement of the 25 registers between the 25:1 
multiplexer and the saturators. If a register is placed at the input to the MDFE 602. then 
the 25 registers would not be needed. However, this would cause the Viterbi input' 
computation to be a larger part of the critical path due to the delays caused by the adders 
and saturators. Thus, by using 25 registers at a location proximate to the MDFE output 
instead of usmg one register located at the wput of the MDFE. the critical path of the 
MDFE and the Viterbi decoder is broken up into 2 approximately balanced components 
This architecture makes it possible to meet the very demanding timing requirements of the 
Gigabit Ethonet transceiver. 

Another advantageous fector in achieving high-speed operation for the treUis 
decoder 38 is the use of heavily truncated rt^nesentations for the metrics of the Viterbi 
decoder. Although this may result in a mmhematically non-zero decrease m theoretical 
performance, the resulting vestigial precision is nevertheless quite sufficient to support 
healthy error margins. Moreover, the use of heavily truncated rtg^resentations for the 

metrics of the Viterbi decoder greatly assists in achieving the requisite high operational 
speeds in a gigabit environment In addition, the reduced precision fecUitates the use of 
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random logic or simple lookup tables to compute the squared errors, i.e,, the distance 
metrics, consequently reducing the use of valuable silicon real estate for merely ancillary 
circuitry. 

FIG. 16 shows the word lengths used in one embodiment of the Viterbi decoder of 
this mvention. In FIG. 16, the word lengths are denoted by S or U followed by two 
numbers separated by a period. The first number indicates the total number of bits in the 
word length. The second number indicates the number ofbits after the decimal point The 
letter S denotes a signed number, while the letter U denotes an unsigned number. For 
example, each ID component of the 4D Viterbi input is a signed 5-bit number having 3 
bits after the decimal point 

FIG. 17 shows an exemplary lookup table that can be used to compute the squared 
1-dimensional errors. The logic fimction described by this table can be implemented using 
lead-only-memory devices, random logic circuitry or PL A circuitry. Logic design 
techniques well known to a person of ordinary skill in the art can be used to implement the 
logic fimction described by the table of FIG, 1 7 in random logic, 

FIGs. 18A and 18B provide a more complete table describing the computation of 
the decisions and squared errors for both the X and Y subsets directly fiom one component 
of the 4D Viterbi input to the ID slicers (FIG. 7). This table completely specifies the 
operation of the slicers of FIG, 7. 

An exemplary dotnodulator uxcluding a high speed decoder has been described and 
includes various components that fecilitate robust and accurate acquisition and decoding 
of PAM-5 constellation signals at speeds consistent with gigabit operation. Symbol 
decoding, including ISI compensation, is accurately performed in a symbol period of abotit 
8 ns, by a transceivo: demodulator circuit constructed in a manner so as to first, bifiircate 
the ISI compensation fimction between an FFE, operating to compensate partial response 
pulse shaping filter (remote transmitter) induced ISI, and a decoder operating to 
compensate ISI perturbations induced by transmission chaimel characteristics, and second, 
by bifiircating critical path computations into substantially balanced first and second 
portions, the first portion including computations performed in a DFE and MDFE element 
and a second portion including computations performed in a Viterbi decoder. 

Hie DFE element is fiirtiier advantageous in that it is implemented as only a single 
conceptual DFE (taking into account its 4D nature) rather than an eigjit element stadc, 
each of vliidi defines a multi-dimensional input to an eight-state Viterbi. The DFE is 
"stufifed", at particular chosen locations, by the first several stages of a sequential, multi- 
stage tentative decision path memory module, so as to develop a set of "tail" coefiScient 
values in the DFE viiiich, taken together, rqjresent the algebraic sum of a truncated set of 
DFE coeflScients Cj to C32. A received symtol, represented by a five level constellation, is 
convolved with the remaiiung two DFE coefiBcients, Cq and C„ vsiiich are taken to 
rqjresent the transmission diannel induced ISL 
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As deskcwed signals enter the decoder, the previous symbol, convolved with the 
DFE coeflScients C3 to Cjj, is first subtracted thercfipom. Then the previom symbol 
convolved with Q is subtracted and the resultant (intennediate) symbol is directed to the 
MDFE. This resultant signal might be described as the receive symbol vdth partial ISI 
introduced by previous symbols subtracted In the MDFE, all possible convolutions of the 
primary coefficients, Q and C„ with the possible symbol values, is subtracted from the 
intermediate symbol to provide a receive symbol without perturbations induced by ISI. 

It will be evident to one having skill in the art that although the transceiver has 
been described in the context of a trellis encoded, PAM-5 signal representation, 
communicated over a multi-pair transmission drnnnel, the invention is not limited to any 
particular communication technique. Specifically, the decoder architecture and signal 
processing methodology in accord vrith the invention is suitable for use with any form of 
communication in which the symbolic contoit of the communication is represented by 
multi-level signals. The invention, indeed, becomes particularly appropriate as the number 
of signal levels increases. 

Neither is the invration limited to signals encoded in accordance with a 4D, eight- 
state, trellis methodology. Trellis encoding forces flie system to be constructed so as to 
accommodate the eight states inherent in the trellis methodology. Other coding 
methodologies and architectures are expressly contemplated by the invention and can be 
implemented by making the proper modifications to an alternative coding architecture's 
"state width", as will be apparent to a skilled mtegrated circuit transceiver designer. 
Likewise, the "dimensional depth**, ID, 2D, 4D.... for example, may be suitably mcrcased, 
or decreased to accommodate different forms of transmission channel implemratations. 
As in the case of increasing signal level representations, the systems aiid methods of the 
invention are particularly suitable for chaimels with increased "depth", such as six, eight, 
or even higher numbers, of twisted pair cabling, single conductor cabling, parallel wireless 
diarmels, and the like. 

In the context of an exemplary integrated circuit-type bidirectional communication 
system, a further aspect of &e invmtion might be characterized as a system and method 
for adq>tively and dynamically regulating the power consumption of an integrated circuit 
communication system as a fimction of particular, user defined sigiud quality metrics. 
Signal quality metrics mi^t include a signal*s bit error rate (BER), a signal-to-noise ratio 
(SNR) specification, noise margin figure, dynamic range^ or the like. Indeed, signal 
quality is a generalized term used to describe a signal's fimctional fidelity. 

As will be understood by one having skill in the art, signal quality is a measurable 
operational characteristic of various componoit portions of modem commimication 
systems. Various forms of signal quality metrics are used to define the features and 
fimctionality of signal processing portions of integrated circuit comrnunication devices, 
particularly coder/decoder circuitry, equalizers and filters, eadi of \^ch require large 
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amounts of silicon real estate for effective implementation, and a consequently large 
degree of power consun^)tion during operation. 

Tximing now to FIG. 28, the inv^tion might be described briefly as a methodology 
for balancing the conflicting circuit performance requirements represented by signm 
quality and power consumption and might be illustrated as the implementation of a 
decision matrix having power consumption as one of the dimensions and a chosen signal 
quality metric as another. From FIG. 28, it will be understood that integrated circuit 
power consumption is directly related to processed signal quality. This is particularly true 
in the case of integrated circuits incoqx>rating high order digital filter elements, having a 
large number of taps, all of which consume power vAicn in operation. 

However, it has been genially accepted integrated circuit design practice to 
construct an integrated circuit commimication device to accommodate the most stringent 
digital processing that might be required by a device in an actual application. In the case 
of an EthOTiet transceiver, for example, provision must be made for processing signals 
transmitted over a wide variety of transmission channels exhibiting widely disparate 
transmission channel characteristics, ranging from extremely lossy, highly populated, long 
wiring run channels, to very short (<2 meters) point-to-point installations. In either case, 
all of the signal processing elements of conventional transceiver circuitry are operative to 
process a signal, \\diether needed or not, such that power consimiption is relatively 
constant and large. 

In FIG. 28, the evaluation matrix juc^es an ou^ut signal quality metric against a 
threshold standard, and where a measured quality metric is greater than the threshold, 
allows the power consumption of the device to be reduced by turning off various 
functional prccessing blocks mtil the ou^ut signal quality is reduced to the threshold 
value. This aproach has particular utility in the case of digital filter elements, 
coder/decoder circuitiy and equalizers, all of which include multiple elements that are 
reqmred for processing signals propagated through harsh channel environments, but to 
various degrees uimecessary when signals are propagated through a more benign channel. 

The evaluation matrix, as exemplified in FIG. 28, might be initialized by a user 
mput requirement, such as the degree to ^^4uch power consiunption is an issue. A 
particular power consimiption value mi^t be set as an operational parameter (indicated as 
*T" in FIG. 28), and portions of the device ad^tively turned off until the desired power 
value is readied. This will necessarily affect the signal qxiality of a signal processed by 
such truncated circuitry, but, in accordance with the invention, signal quality is able to be 
locally maximized to a pre-determined power consumption metric, such that device 
performance is not unduly sacrificed. 

Various portions of the device might be powered-down in predetermined 
sequential combinations with each combination resialting in a particular performance 
metric. Signal poformance is evaluated at each sequential step. Thus, any one power 
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consumption specification ,i.e. 'T", will give a range of perfbnnancc values (represented 
as "A" in FIG. 28). The best signal performance result is necessarily the chosen metric for 
deciding which of the multiplicity of power-down configurations is implementei 
Consequently, where power is the primary concern, signal quality defaults to the best 
signal performance achievable at the specified power level. 

Where signal quality (performance) is the primary concern, the system is allowed 
to fimction normally, with all processing blocks operative. In this circumstance, power 
consumption will be expected to be nominal. 

Where signal quality is desirable, but some accommodation must be made to 
power consumption, a user may set a signal quality metric as a threshold standard 
(mdicated as "Q" in FIG. 28), and aUow the system to ad^tively and dynamically run 
through a multiplicity of power-down configurations, resulting in a range of power 
consumption values (indicated as **B" in FIG. 28) in order to determine vsiiich of the 
configurations gives the lowest power consumption while retaining the desired signal 
quality metric. This methodology is particularly effective in high order filters with 
multiple taps, and in decoder blocks that might implement a trellis decoder in a fiilly 
fimctional form, but v/inch might be adequate when truncated to a simple slicer in certain 
situations. 

No matter how implemented, however, all that is required for practice of flie 
invention is that power consumption be established as one basis of an evaluation matrix, 
and that some signal quaUty or device performance characteristic, having a relationship to 
device power consumption, be estabUshed as another. As one of the bases are defined, as 
by a user input, for example, the other basis is locally maximized (in the case of 
performance) or minimized (in the case of power) by an ad^tive and dynamic procedure 
that chooses the most pertinent portions of an integrated circuit to disable. The procedure 
is adaptive in the sense that it is not fixed in time. As channel and signal characteristics 
can be e?q}ected to vary with time, a changing signal quality metric will force a re- 
evaluation of the matrix. A fiirtiiCT reduction of power consumption, or a finther 
enhancement of signal quality may be obtained. 

In order to appreciate the advantages of the present invention, it wiU be beneficial 
to describe the invention in the cont^ of an exemplary bidirectional communication 
device, such as an Ethemet transceiver. The particular exemplary implementation chosen 
is depicted in FIG. 1, whidi is a simplified block diagram of a multi-pair communication 
system operating in conformance wifli the IEEE 802.3ab standard (also termed 
lOOOBASE-T) for 1 gigabit (Gb/s) Etiiemet fiill^uplex communication over four twisted 
pairs of Category-5 copper wires. 

The communication system illustrated in FIG. 1 is rqiresented as a point-to-point 
system, in order to simplify the e}q>lanation, and includes two main transceiver blocks 1 02 
and 104, coi^led together via four twisted-pair cables 1 12a, b, c and d. Each of tfie wire 
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pairs 1 12a, b, c, d is coiq)led to each of the transceiver blocks 102, 104 through a 
respective one of four line interfiice circuits 106. Each of the wire paire 1 12a, b, c, d 
fecilitates conmiimication of information between corresponding pairs of four pairs of 
transmitter/receiver circuits (constituent transceivers) 108. Each of the constituent 
transceivers 108 is coupled between a respective line interface circuit 106 and a Physical 
Coding Sublayer (PCS) block 1 10, At each of the transceiver blocks 102 and 104, the four 
constituent transceivers 108 are capable of operating simultaneously at 250 megabits of 
information data per second (Mb/s) each, and are coupled to the corresponding remote 
constituent transceivers through respective line interface circuits to facilitate full-duplex 
bidirectional operatioit Thus, IGb/s communication throughput of each of the transceiver 
blocks 102 and 104 is achieved by using four 250 Mb/s (125 Mbaud at 2 information data 
bits per symbol) constituent transceivers 108 for each of the transceiver blocks 102, 104 
and foux pairs of twisted copper cables to connect the two transceiver blocks 102, 104 
togetiier. 

The exemplary communication system of FIG. I has a superficial resemblance to a 
100BASE-T4 system, but is configured to operate at ten times the bit rate. As such, it 
should be understood that certain system performance characteristics, such as sampling 
rates and the like, will be consequently higher and cause a greater degree of power 
consumption. Also, at gigabit data rates over potentially noisy channels, a proportionately 
greater degree of signal processing is required in many instances to insure an adequate 
degree of signal fidelity and quality. 

nG,2 is a simplified block diagram of the fimctional architecture and internal 
construction of an exemplary traixsceiver block, indicated generally at 200, such as 
transceiver 102 of FIG. 1 . Since the illustrative transceiver application relates to gigabit 
Ethernet transmission, the transceiver will be refered to as the "gigabit transceiver". For 
ease of illustration and description, FIG. 2 shows only one of the four 250 Mb/s 
constituent transceivers which are opoiating simultaneously (termed herein 4-D operation). 
Ho wevCT, since the operation of the four constituent transceivers are necessarily 
interrelated, certain blocks and signal lines in the exemplary embodiment of FIG. 2 
perform four-<iimensional operations and carry four-dimensional (4-D) signals, 
respectively. By 4-D, it is meant that the data fit>m the four constituent transceivers are 
used simultaneously. In order to clarify signal relationships in FIG. 2, thin Imes 
correspond to 1 -dimensional fimctions or signals (i.e., relating to only a single constituent 
transceiver), and thick lines correspond to 4-D fimctions or signals (relating to all four 
constituent transceivos). 

Referring to FIG. 2, the gigabit transceiver 200 includes a Gigabit Medium 
Independent Inter&ce (GMII) block 202 subdivided into a receive GMII circuit 202R and a 
transmit GMII circuit 202T. ITie transceiver also includes a Physical Coding Sublayer 
(PCS) block 204, subdivided into a receive PCS circuit 204R and a transmit PCS circuit 
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204T, a pulse shaping filter 206, a digital-to analog (D/A) converter block 208, and a line 
interface block 210, aU generaUy encompassing the transmitter portion of the transceiver. 

The receiver portion generally includes a highpass filter 212, a programmable gain 
amplifier (PGA) 214, an analog-to-digital (A/D) converter 216, an automatic gam control 
(AGC) block 220, a timing recovery block 222, a pair-swap multiplexer block 224. a 
demodulator 226. an oflfeet canceUer 228, a near-end crosstalk (NEXT) canceller block 
230 having three constituent NEXT cancellers and an echo canceller 232. 

The gigabit transceiver 200 also mcludes an A/D fiist-in-firet-out buffer (FIFO) 
218 to feciUtate proper transfer of data fiom the analog clock region to the receive clock 
region, and a loopback FIFO block (LPBK) 234 to feciUtate proper transfer of data from 
the transmit clock region to the receive clock region. The gigabit transceiver 200 can 
optionally include an additional adaptive filter to cancel fer-end crosstalk noise (FEXT 
canceUer). 

In operational terms, on the transmit path, the transmit section 202T of the GMII 
block receives data from the Media Access Control (MAC) module m byte-wide format at 
the rate of 125 MHz and passes them to the transmit section 204T of the PCS block via the 
FIFO 201 . The FIFO 201 ensures propo: data transfer bom the MAC layer to the Physical 
Coding (PHY) layer, since the transmit clock of tihe PHY layer is not necessarily 
synchronized with the clock of the MAC layer. In one embodiment, this small FIFO 201 
has fiom about three to about five m«nory cells to accommodate the file elasticity 
requiicment -wWch is a fimction of frame size and frequency ofi&et 

The PCS transmit section 204T performs certain scambling operations and, in 
particular, is responsible for encoding digital data into the requisite codeword 
representations appropriate for transmission. In, the illustrated embodiment of HG. 2, the 
transmit PCS section 204T incorporates a coding engine and signal mapper that 
implements a trellis coding architecture, such as required by the IEEE 802.3ab 
specification for gigabit transmission. 

In accordance with tihis encoding architecture, Ae PCS transmit section 204T 
generates four 1-D symbols, one for each of the four constiturait transceivers. The 1-D 
symbol generated for the constituent transceiver depicted in HG. 2 is filtered by the pulse 
shaping filter 206. Hiis filtering assists in reducing the radiated emission of the output of 
the transceiver such that it fidls within the parameters required by the Fedcsal 
Communications Commission. The pulse shaping filter 206 is implemented so as to 
define a transfer fimction of 0.75 -K).25z-'. This particular implementation is chosen so 
that thepower spectrum of the ou^ut of the transceiver falls below the power spectrum of 
a 100Base-TX signal. The lOOBase-Tx is a widely used and accepted Fast Ethernet 
standard for 100 Mh/s operation on two pairs ofCategory-5 twisted pair cables. The 
ou^ut of the pulse shaping filter 206 is converted to an analog signal by the D/A converter 
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208 operating at 125 MHz. Hie analog signal passes through the line inteifece block 210, 
and is placed on the corresponding twisted pair cable. 

On die receive path, the line interfece block 210 receives an analog signal fiom the 
5 twisted pair cable. The received analog signal is preconditioned by the highpass filter 212 
and the PGA 214 before being converted to a digital signal by the A/D converter 216 
operating at a sampling rate of 125 MHz. The timing of the A/D converter 216 is 
controlled by the output of the timing recovery block 222. The resulting digital signal is 
properly transferred firam the analog clodc region to the receive clock region by the A/D 

10 FIF0218. The output ofthe A/D FIFO 218 is also used by the AGC 220 to control the 
operation of the PGA 214. 

Hie ou^ut of the A/D FIFO 218, along with the outputs from the A/D FIFOs ofthe 
other three constituent transceivers are inputted to the pair-swap multiplexer block 224. 
TTie pair-swap multiplexer block 224 uses the 4-D pair-swap control signal fiom the 
15 receive section 204R of PCS block to soil out the four input signals and send the coircct 
signals to the respective feedforward equalizers 26 ofthe demodulator 226. This pair- 
swapping control is needed for the fiallowing reason. The trellis coding methodology used 
for the gigabit transceivers (102 and 104 of FIG. 1) is based on the fact that a signal on 
each twisted pair of wire corresponds to a respective 1-D constellation, and tiiat the signals 
20 transmitted over four twisted pairs collectively form a 4-D constellation. Thus, for the 
decoding to woric, each ofthe four twisted pairs must be uniquely identified with one of 
the four dimensions. Any undetected swappmg of the four pairs would result in erroneous 
decoding. In an alternate embodiment of the gigabit transceiver, the pair-swappmg control 
is performed by the demodulator 226, instead ofthe combination ofthe PCS receive 
25 section 204R and the pair-swap multiplexer block 224. 

The demodulator 226 includes a feed-forward equalizer (FFE) 26 for each 
constituent transceiver, coiq)led to a deskew memory circuit 36 and a decoder circuit 38, 
implemented in the illustrated embodiment as a trellis decoder. The deskew memory 
circuit 36 and the trellis decoder 38 arc common to all four constituent transceivers. The 
30 26 receives the received signal intended for it fiom the pair-swap multiplexer block 

224. The FFE 26 is suitably implemented to include a precursor filter 28, a programmable 
inverse partial response (IPR) filter 30, a summing device 32, and an adaptive gain stage 
34. The FFE 26 is a least-mean-squares (LMS) type adaptive filter which is configured to 
perfi)rm diannel equalization as will be described in greater detail below. 
35 The precursor filter 28 generates a precursor to tiie input signal 2. This precursor is 

used for timing recovery. The transfer fiinction of the precursor filto- 28 mi^t be 
represented as -y4^', with y equal to 1/16 for short cables (less tiian 80 metere) and 1/8 
for long cables (more than 80 m). The determination of tfie Icaigtii of a cable is based on 
tfie gain of the coarse PGA 14 of the programmable gain block 214. 
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The programmable IPR filter 30 compensates the ISI (intersymbol interference) 
introduced by the partial response pulse shaping in the transmitter section of a remote 
transceiver which transmitted the analog equivalent of the digital signal 2. The transfer 
fimction of Ae IPR filter 30 may be e)q)ressed as 1/(1+Kz''). In the present example, K 
has an exemplaiy value of 0.484375 during startup, and is slowly ramped down to zero 
after convergence of the decision feedback equalizer included inside the trellis decoder 38. 
The value of K may also be any positive value strictly less than 1 , 

The summing device 32 receives the output of the IPR filter 30 and subtracts 
therefrom adaptively derived cancellation signals received fi:om the adaptive filter block, 
namely signals developed by the ofibet canceller 228, the NEXT cancellers 230, and the 
echo canceller 232, TTie ofl&et canceller 228 is an adaptive filter which generates an 
estunate of signal ofi&et introduced by con^>onent circuitry of the transceiver's analog 
fit)nt end, particularly oflfeets introduced by the PGA 214 and the A/D converter 216. 

The fluee NEXT cancellers 230 may also be described as adaptive filteis and are 
used, in the illustrated embodiment, for modeling the NEXT impairments in the received 
signal caused by interference generated by symbols sent by the three local transmitters of 
the other three constituent transceivers: These impairments are recognized as being caused 
by a crosstalk mechanism between neighboring pairs of cables, thus the term near-end 
crosstalk, or NEXT. Since each receiver has access to the data transmitted by the other 
three local transmitters, it is possible to approximately replicate the NEXT impairments 
through filtering. Referring to HG. 2, the three NEXT cancellers 230 filter the signals sent 
by the PCS block to the other three local transmitters and produce three signals replicating 
the respective NEXT impairments. By subtracting these three signals from the output of 
the DPR filter 30, the NEXT impairments are ^proximately cancelled. 

Due to the bi-directiorud nature of the diannel, each local transmitter causes an 
echo impairment on the received signal of the local receiver with vMch it is paired to form 
a constituent transceiver. In order to remove this impairment, an echo canceller 232 is 
provided, which may also be diaracterized as an adaptive filter, and is used, in the 
illustrated embodiment, for modeling the signal impairment due to echo. The echo 
canceller 232 filters flie signal sent by the PCS block to the local transmitter associated 
with flie receiver, and produces an approximate replica of the echo impairment By 
subtracting this replica signal fix)m the ou^ut of the IPR filter 30, the echo impairment is 
^proximately cancelled. 

The adaptive gain stage 34 receives the processed signal from the summing circuit 
32 and fine tunes the signal path gain usirig a zero-forcing lAlSalgoritiu^ Since this 
adaptive gain stage 34 trains on the basis of error signals generated by the adaptive filters 
228, 230 and 232, it provides a more accurate signal gain than the one provided by the 
PGA 2 14 in the analog section. 
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The output of the ad^tive gain stage 34, ^\*ich is also the output of the FFE 26, is 
inputted to the deskcw memoiy circuit 36. The deskcw memory 36 is a four-dimensionai 
fimction Mode, i.e., it also receives the ou^uts of the three FFEs of the other fhrec 
constiUient transceivers. There may be a relative skew in the outputs of the four FFEs, 
w*ich are the four signal samples representing the four symbols to be decoded. This 
relative skew can be up to 50 nanoseconds, and is due to the variations in the way the 
copper wne pairs are twisted. In order to correctly decode the four symbols, the four 
signal samples must be proporly aligned. The deskew memoiy aligns the four signal 
samples received from the four FFEs, then passes the deskewed four signal samples to a 
decoder circuit 38 for decoding. 

In the context of the exemplary embodiment, the data received at flie local 
transceiver was encoded before transmission, at the remote transceiver. In tiie present 
case, data might be encoded using an 8-state four-dimensional ttellis code, and the decoder 
38 might therefore be unplemented as a trellis decoder. In die absence of intersymbol 
mterference (ISI), a proper 8-state Viterbi decoder would pix)vide optimal decodmg of this 
code. However, in the case of Gigabit Ethernet, the Category-5 twisted pair cable 
introduces a significant amount of ISI. In addition, the partial response filter of the remote 
transmitter on the other end of the communication channel also contributes some ISI. 
Therefore, the trellis decoder 38 must decode both the trellis code and the ISI, at the high 
rate of 125 MHz. In the illustrated embodiment of the gigabit transceiver, the trellis 
decoder 38 includes an 8-state Viterbi decoder, and uses a decision-feedback sequence 
estimation ^proach to deal with flie ISI components. 

The 4-D output of the trellis decoder 38 is provided to the PCS receive section 
204R. The receive section 204R of the PCS block de-scrambles and decodes the ^mbol 
stream, then passes the decoded packets and idle stream to the receive section 202T of the 
GMn block viiiich passes them to the MAC module. The 4-D outputs, which are the error 
and teitative decision, respectively, are provided to the timing recovery block 222. whose 
ou^ controls the sampling time of die A/D converter 216. One of the four components 
of tiie error and one of the four components of the tentative decision correspond to the 
receiver shown in FIG. 2, and are provided to tiie adaptive gain stage 34 of the FFE 26 to 
a^ust fhe gain of ttie equalizer signal patt. The error component portion of the decoder 
ou^ signal is also provided, as a control signal, to ad^tation circuitry incorporated in 
each of the adaptive filters 228. 229, 230, 231 and 232. Adaptation circuitry is used for 
the updating and training process of filter coefficients. 

FIG. 3 is a block diagrani of tiie trellis decoder 38 of FIG. 2 The hellis decoder 38 
includes a multiple decision feedback equaUzer (MDFE) 602, a Viterbi decoder 604, a 
path metiics module 606, a path memoiy module 608, a select logic 610, and a decision 
feedback equalizer 612. Here are eight Viteri)i inputs and eight Viterbi decisions 
cotresponduig to die eight states. Eadi of die eight Viterbi inputs (respectively, decisions) 
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is a 4-dimcnsional vector whose foiir components arc the Viterbi mputs (respectively, 
decisions) for the four constituent transceivers, respectively. 

The adaptive filters used to implement the echo canceller 232 and the NEXT 
cancellers 229, 230 and 23 1 are typically finite impulse response (FIR) filters. FIG. 29 A 
shows a structure of an adaptive FIR filter used as an echo/NEXT canceUer in on^ 
embodiment of the gigabit transceiver. 

Referring to FIG; 29A. the adaptive FIR filter includes an mput signal path an 
output signal path P^, and N taps (N is nine in FIG. 29A). Each t^ connects a point on 
the input signal path to a pomt on the output signal path P^. Each tap, except for the 
last tap, includes a coefficient Cj, a multiplier and an adder A^, i=0,.„,N-2. The last tap 
includes the coefficient CVi, the multipUer M^j, and no adder. The coefficients C;, \^ere 
, are stored in coefficient registers. During each adaptation process, the values of 
the coefficients C, arc trained using a well-known least-mean-squares algorithm by an 
adaptation circuitry (not shown m FIG. 29A). After training, the coefficients C. converge 
to stable values. The FIR filter includes a set of delay elements D^, conventionally denoted 
by z * in FIG. 29A. The number of delay elements D; determines the order of the FIR 
filter. The output y(n), Le., the filter output at time instant n, is a fimction of the input at 
time instant n and of the past irq)uts at time instants n-1 through n-(N-l), and is expressed 
as: 

where x(n-i) denotes the input at time instant n-i, and N denotes the nxmiber of taps. The 
output y(n), as shown m Equation (1), is a weighted sum of the input data x(n-i), with 
i=K),...,N-L The coefficients Q act as the weighting factors on the input data. If a 
coefficient Cj has a very small absolute value, relative to the values of other coefficients, 
dien the contribution of the corresponding input data x(n-i) to the value of y(n) is relatively 
insignificant 

FIG. 29B is an equivalent structure of the filter shown in FIG, 29A. The two 
structures m FIGs. 29A and 29B provide the same filter transfer fimction, but differ in 
certain performance characteristics. The difference is due to the placement of tiie delay 
Clements D., i=l,.. Jl-l (N=9 in HGs. 29A, 29B). If all tfie delay elements arc placed m 
the input pafli P^^ as in the well-known direct form of tiic FIR filter, then the registas that 
arc used to implement the delay elements are small, need only to be of fixe same size as the 
input data x(n). If all the delay el«nents arc placed on the output path P«^ as in flie well- 
known transposed form of the FIR filter, then the registers used as tiie delay elements must 
have more bits in order to hold the largest possible sum of products C>x(n-i). Large 
registers cost more and consume more power than small registers. Thus, the advantage of 
placing fte deh^ elemmts on the input path instead of the ou^ut path is that fewer 
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register bits arc required. However, the larger the number of the delay elements on the 
input path, the lower the operating speed of the filter is. 

If the propagation delay firom the input of the filter to the last tap exceeds the 
required clock period, then the filter is not usable. To break the long propagation delay, 
that would occur if all the delay elements were placed on the input pafli P^^, into small 
delay interval^ some of the del^ elements are placed on the output path P^, at regular 
intervals, as shown in the filter structures in FIGs. 29A and 29B. The structure in FIG. 
29B, v^ch has a "two-to-one" split of delay elements between the input path and the 
output path, can operate at a higher clock speed than the structure in FIG. 29 A, which has 
a "three-to-one" split Computational results show that both of these structures are 
acceptable for use in a high-speed system such as the gigabit transceiver. 

The taps of the adaptive FIR filters used in the gigabit transceiver can be switched 
from an active state to an inactive state. FIG. 29C shows a modification to the structure of 
FIG. 29B to bypass a deactivated tap. 

Referring to FIG. 29C, the filter structure includes a bypass circuit for each adder 
A^, i^,,.. Each bypass circuit includes a gate Gj indicated as an AhfD gate, and a 
multiplexer Uj. Also associated with each bypass circuit is a control signal S, which 
indicates the active or inactive state of the tap having tfie coeflBcient C; and the adder A^. 
Si is set equal to one if the tap is intended to be active, and set equal to zero if the tap is 
intended to be inactive. When 8;=!, the output of any arbitrarily chosen gate Gj is equal to 
the data signal at the input of that gate G^. At the corresponding multiplexer Uf, m the case 
Miierc Si=l, only the output signal fix)m the adder A^ is outputted by the multiplexer. In 
the case where Si=0, the ou^ut of gate G^ is zero, and the data signal at the input of gate Gj 
flows to the multiplexer via the corresponding bypass connection Bj, bypassing the 
adder A,., At the multiplexer U,-, due to Si=0, only the data signal from the bypass 
connection B^ is outputted. 

The foregoing is only one exemplary implementation of a filter configuration 
v^iierein taps can be switched between active and inactive states. An alternative 
implementation is one whore the multipliers M, coupled to receive filter coefiBcients from 
associated coefficient registers are able to be switched between active and inactive states. 

FIG. 29D is a semi-schematic block diagram of a multiplier 2900, such as might be 
associated witii each tap coefficient The multiplier 2900 is configured to receive a 
coefficient word, from a corresponding coefficiait register. The coefficient word is 
received in a multiplexer circuit 2902, vAnch receives the coefficient in two 
configurations: a first **raw" configuration taken dkecfly from flie coefficient register, and 
a second "times 2" configuration taken fi^m tiie register but shifted one position to the 
left The second coefficient configuration, then, represents the "raw*' value multiplied by 
two. Since the second coefficient configuration is a shifted one and, necessarily contains 
one fewer bit than the "raw" coefficient, the "times two" coefficient set is padded by the 
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bit value 0 (this is done by tying the least significant bit to V^s, \A)ich is ground). This is a 
particularly efiScient implementation of a multiplier which takes advantage of the fact that 
tiie symbols can only have the values {-2, -1, 0, +1, +2}. The symbols are represented by 
tfiree bits in sign-magnitude representation, with bit 2 indicating the sign (+ or 

A select OR gate 2904 "ORs" an OFF signal with the value of symbol bit 0 to 
select which coefiScient representation is selected to pass through the multiplexer 2902. 
When the value of symbol bit 0 is 1, the **raw** coefficient, representing either -1, 0, +1 is 
selected. When OFF is equal to logical 1, the same condition applies. The coefiScient 
selected by multiplexer 2902 is directed to one input of an XOR gate whete it is 
exclusively "ORed" with an output signal fix)m a select AND gate 2908. The AND gate 
2908 "ANDs" an inverted OFF signal with the symbol bit 2 value. When OFF is logical 0, 
i,e., inverted OFF is logical 1, and when symbol bit 2 is 1, the XOR fimctions to designate 
that the sign of the coefficient is negative. It should be understood that the XOR is 
configured as a stack (of 10 individual XOR gates), and that manipiilation of the cany bit 
determines the sign of the coefficients. 

The signed coefficient is directed to an additional AND gate 2910, where it is 
"ANDed" wifli the output of a second select AND gate 2912, The output of second select 
AND gate 2912 is developed by "ANDing** the inverted OFF signal with the "ORed" 
result between symbol bits 0, 1 and 2. The effective fimction of OR gate 2914 is to 
differ^ate between the symbol zero value and the other symbol values. In effect, OR 
gate 2914 is a symbol {0} detect circuit 

Tap disablanent is a fimction of the OFF signal value. When OFF is logical 1 , the 
multiplexer is set to select "one" ie., the ^'mw" coefficient When OFF equal to logical 1, 
inverted OFF is logical 0, causing the first and second select AND gates 2908 and 2912 to 
output a zjero regardless of the value of the symbol bit iiqiut Since the oiitput of AND 
gate 2912 is zero, the AND gate stack 2910 also outputs a zero, which is directed to a 
corresponding tap addra- A, in the ou^ut path of the ad^tive filter (FIGS. 29A, 29B or 
29C). Adding a zem reqiiires no conq)Utation and the tap is thus effisctively deactivated. 

The underlying reason for ORmg the OFF signal in the OR gate 2904 and for 
ANDing the inverse OFF signal in the AND gate 2908 is to aisure that no transitions take 
place mside tiie multiplier v/hen the tap is deactivated. Without the OFF signal as input to 
tiie OR gate 2904, the select input to the multiplexer 2902 will toggle dependmg on the 
value of the symbol bit 0. Mthout the inverse OFF signal as input to the AND gate 2908, 
one of the two inputs to the XOR 2906 will toggle depending on tiie value of the symbol 
bit 2. This toggling, or transition, would dissipate powCT. Hie reason for ANDing the 
mverse OFF signal in the AND gate 2912 is to ensure that the multiplier oxitput (vrfaich is 
tiie ou^ut of AND gate 29 10) is zero when the t^ is deactivated. 

Referring back to FIG. 2, flie adaptive FIR filters used as the echo canceller 232 
and die three NEXT cancellers 229, 230 and 231, require large numbers of taps to be 
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eflfective as canceUers for a wide range of twisted pairs of cables. Echo/NEXT responses 
differ for different cables, and require diflFerent taps in the cancellers to model them. 
Therefore, canceUers are built with enough t^s to provide adequate cancellation with the 
worst-case expected cable responses. For example, in the Ulustrated embodiment of the 
gigabit transceiver of FIG. 2, each echo canceUer has one hundred ninety two (192) taps, 
and each NEXT canceller has thirty six (36) taps (it is noted that there is also a total of 132 
taps in the DFE which are always active). Since there are four echo canceUers (one per 
constituent transceiver) and twelve NEXT canceUers (three per constituent transceiver) in 
the gigabit transceiver, the total number of taps Ujat can be activated or deactivated in die 
gigabit transceiver is twelve hundred (1200). When active, each of these taps consumes a 
ioaaU amount of power. Due to their large number, if all of the taps are active at flie same 
time, tfKsir individual power consumption values wiU sum to significantiy large total power 
consumption figure. This power consumption, if not regulated, generaUy causes a high 
degree of localizsed heating in an integrated circuit; often resulting in reliability issues, 
skewed circuit petfonnance and, in some cases, catastrophic device feilure. 

Regulation of tiiis power consumption is possible smce not aU of the taps are 
required to be active on any given channel at any given time. The taps that are not 
required to be active arc the ones fliat do not significantiy contribute to tiie petfonnance of 
the system. However, which taps are not required to be active at a given time is not 
known a priori Such unnecessaiy taps can become needed at a diflFerent time due to 
dynamic changes in the cable response. The present invention dynamicaUy determines 
which, if my, taps are unnecessary for adequate performance in a particular appUcation, 
and deactivates Uiem. ITie present invention also re-activates any previously deactivated 
taps that subsequenfly become necessary, due to changes in tiie cable response, for system 
performance improvement As appUcd to the adaptive filters, the method of the present 
invention might be characterized as a tap power regulation metiiod. 

FIG. 30 is a flowchart of a first exemplary embodiment of a method for 
implementing principles of die present invention. A specified error and a specified power 
arc provided. They may be specified by a user. Hie specified power represents the 
maximum power consumption tiiat is allowed. If no power is specified, it is assumed to be 
infinite. The specified error represents the maximum degradation of the system 
performance that is aUowed and is preferably expressed as a mean squared error (MSE). 
Since tiie signal power is constant, die MSE corresponds to a ratio of mean squared error 
to signal(MSE/signal)usuaUy e^qjressed in decibels (dB). 

In FIG. 30, before tiie start of process 3000, no coefficient is active. Upon start 
(block 3(K)2), process 3000 mitiaUzes a tiireshold to a value (block 3004). TTiis initial 
value of tiie tiireshold can result &om a simulation test, or can be equal to ttie minimum 
absolute value of a tq» coefficient (as known fiom past experiments). This value is not 
critical as long as it is suffidentiy low to avoid a large degradation of tiie systein 
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peifonnance. The laps in a first block are activated (block 3006). The size of this first 
block. i.e., the number of taps in die first block, depends on the appUcation. In one 
application, this number is 120. the coefficients of the active taps are trained with the 
LMS algorithm until convergence (block 3008). 

The absolute values of the active tap coefficients arc compared with the threshold 
(block 30!0). The tqjs whose absolute values are less than the threshold are deactivated 
(block 3012). An error metric, typically a mean squared error (MSE) corresponding to a 
ratio of mean squared error to signal, and a poww metric arc computed (block 3014). 
Process 3000 flien checks whether a first test is satisfied (block 3016). In the first 
embodiment of the invention, this first test is satisfied when the error metric is greater than 
the specified error and the power metric is smaUer than the specified maximum power. If 
the error metric is greater than the specified error, this implies fliat tiie threshold has been 
set too high, causing too many taps to be deactivated, and tiiis has degraded the system 
perfijrmance by more than the specified amount. If the first test is satisfied, then tire 
threshold is decreased (block 3018). and all the taps in tiie block being considered are 
activated again (block 3006) and process 3000 proceeds wifli a lower tiueshold. 
Otherwise, process 3000 determines whether all the taps of the filter have been considered 
(block 3020). If not, then the next block of taps is considered, and this new block of taps 
is activated (block 3006). A typical size of this next block of taps is 20. All of the active 
t^ coefficients, including the new activated tap coefficients, are converged with an LMS 
algoritiim (block 3008) and process 3000 proceeds as described above. 

If all of the t^s have been considered, then process 3000 checks whether a second 
test is satisfied (block 3024). In tiie first embodiment of flie invention, the second test is 
satisfied when the error metric is smaller than the specified error or tiie power metric is 
larger tiian the specified power. Iftiie error metric is smaller than tiiie specified error, tiiis 
impUes tiiat it is possible to increase the tiireshold to deactivate mor« taps and still meet 
She system performance requirement Ifthe power metric is greater tiian the specified 
power, flien tiie tiireshold must be increased to lower the power consumption, regardless of 
the system performance requirement If tiie second test is satisfied, tiien tfie timshold is 
increased (block 3026) and tiie active taps arc compared witii tiie updated tiireshold (block 
3010). Oflierwise, process 3000 turns off tiie power on tiie taps fliat are subsequent to tiie 
tap which has the last highest ordered active coefficient (block 3028). In otiier words, if 
Cfc is tiie last highest ordered active coefficient, tiien aU tiie taps tiiat have tiie deactivated 
coefficients Q^itiiroughCN., are powered down. More details on tiie power down 
function in block 3028 arc provided below. Process 3000 tiien terminates (block 3030). 

When process 3000 is restarted (block 3032), a block of taps is activated (block 
3006). Upon restart of process 3000. tiie tiireshold is at its last value from tiie last 
qjplication of process 3000. The coefficiaits tiiat were previously deactivated are 
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activated with their values remaining at their last values before deactivation. Then process 
3000 proceeds as described above. 

Periodic restart ofprocess 3000 is desirable for the following reason. In some 
cases, the echo/NEXT path impulse response may change during normal operation. For 
example, this change may be a result of temperature changes. To correct for fliis change, 
process 3000 periodically restarts to turn on Ae deactivated coefficients in a sequential 
manner (block 3006), re-converges the coefficients (block 3008), and determines whether 
the previously deactivated coefficients arc stiU below the threshold (block 3010). If the 
previously deactivated coefficients are now converged to values above the threshold, they 
remain active, otherwise they are deactivated (block 3012). Any of the mitially active 
coefficients that now M below the direshold are also deactivated (block 3012). 

The underlying reason for activating die taps a few at a time (block 3006 through 
3020) is the following. When the total number of taps is very large, the power 
consumption can be very large during the initial convergence transient. This peak power 
consumption is very undesirable, and is unaffected by die tap power regulation process 
(which can only reduce the average power consumption of the filters). One solution to this 
peak power consumption problem is to activate and converge die taps in an initial small 
block of taps (blocks 3006, 3008), deactivate some of the converged taps according to a 
criterion (block 3010 tiirough block 3020), activate a next block of taps (block 3006), 
converge all the active taps including the newly activated taps (block 3008), and repeat the 
process of deactivation, activation and convergence until all die taps of tire filter are 
processed. 

Power-down block 3028, which is optional, ofprocess 3000 helps fiirther reduce 
the power consumption oftiie adaptive filters. Without block 3028, alfliough the tap 
power regulating process 3000 already achieves a large reduction of the power 
consumption by reducing die number of active taps, tiiere is still a significant amount of 
power dissipated by the long dehiy line oftiie adaptive filter. By delay line, it is meant tiie 
line connecting die delay elements togetiier. Turning a tap off does not necessarily affect 
the configuration of the delay Ime. However, \n many practical cases, many of the 
deactivated taps are located contiguously at the highest-ordered end oftiie filter. An 
example of sudi a case is when die cable is short and well behaved. In such cases, the 
portion oftiie delay line associated witii these contiguously deactivated taps can be 
completely powered down witiiout affecting tiie transfer function of the filter. This 
powering down contributes an additional reduction of power dissipation oftiie filter. In 
one exemplary application, this additional reduction of powCT dissipation is approxunately 
300 milliwatts (mW) per echo cancello: and 70 mW per NEXT canceller, resulting in a 
power saving of 2.04 Watts for flie gigabit tiansceivCT. 

An exemplary implementation of block 3028 is as follows. An additional bit, 
called tiie delay line enable bit, is associated witii each tap of a filter. This bit is initially 
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ON. When process 3000 reaches block 3028, aU of the taps are scanned for active status 
starting from the highest-ordeied end of the filter. i.e., the tap including the coefficient 
„ towards the lowest-ordered end, i.e., the tap including the coefficient Co- During 
scanning, the delay line enable bits of the scanned inactive taps are switched OFF until tiie 
first highest-ordered active tap is found. At tiiis point, tiic scanning for lap active status 
terminates. Then all tiie delay line sections corresponding to die taps whose delay line 
enable bits are OFF are powered down. 

Activation block 3006 of FIG. 30 is applied sequentially to tiie echo canceller 232 
and flietiiree NEXT cancellers 229. 230 and 231 (ofnG.2). FIG.31 iUustrates tiie 
flowchart of one exemplary embodiment of the activation block 3006. 

Referring to HG. 31, upon start (block 3102), tiie process 3006 sets tiie filter 
number to zero (block 3104) to operate on tiie echo canceUer. The filter number zero 
represents tiie echo canccUer, while filter numbers 1 tiirough 3 represent tiie tiiree NEXT 
canceUers, respectively. Process 3006 tiien sets tiie address and tiie end equal to tiie start 
address and tiie end address of tiie block of taps, respectively (block 3 106). The modules 
TapOn and Tap PowerUp are invoked wifli tfie address as argument (block 3 108). The 
module TapOn turns on tiie circuitry of tiie tap havmg flie specified address. This circuitry 
includes a l-bit storage to indicate tiie active status of tiie tap. When tiie tap is turned on, 
tiie tap is included in tiie computation of tiie output y(n) of tiie filter (referring to Equation 
(1)), and in tiie adaptation process, i.e.. tiie traimng and convergence of tiie filter 
coefficients. The module TapPowerUp turns tiie power on for tiie delay line section 
associated witii tiie tap having tiie specified address. Process 3006 tiien determines 
v^etfier tiie address is equal to tiie end. If it is not, flicn tiie address is increased by one 
(block 31 12), to consider tiie next tap of tiie filter. If tiie address has reached tiie end 
address of tiie block of taps, tiien process 3006 determmes vs^ettier filter number is equal 
to 3, Le., Ti^^ether all the filters in the transceiver have been considered (block 3114). If 
not, tiien filler number is increased by one. so tiiat tiie next filter is considered. If process 
3006 has operated on all tiie filters, tiien process 3006 sets tiie start address equal to tiie 
old end address, and sets tiie new end address equal to tiie sum of tiie old end address and 
flie Mode size, tiie block size being ttie size of tiie next block of taps to be activated (block 
3118). Process 3006 tiien terminates (blodc 3 120). 

Deactivation block 3012 of HG. 30 is appUed sequentiaUy to tiie echo canceUer 
232 and tiie tiiree NEXT canceUers 230 (of FIG. 2). FIG. 32 Ulustrates tiie flowchart of 
one embodiment of the deactivation block 3012. 

Referring to FIG. 32, upon start (block 3202), tiie process 3012 sets tiie filter 
number to zero (block 3204) to operate on tiie echo canceUer. The filter number zero 
represents tiie echo canceUer, vAulc filter numbers 1 tiirough 3 represent tiie tiuee NEXT 
canceUers, respectively. Process 3012 tiien sets flie address equal to zero and tiie end 
equal to flie lengtii of tiie filter minus 1 (block 3206). If tiie absolute value of tiie tap 
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coefficient at the specified address is less than T, the threshold, then the module TapOn is 
invoked to turn off the circuitry associated with the tap having the specified address (block 
3208). When the tap is turned ofE; the tap is removed fix)m the computation of the ou^ut 
y(n) of the filter (referring to Equation (1)), and fiom the adaptation process, i.e., the 
trainmg and convergence of the filter coefficients. Process 3012 dien deteimmes whether 
flie address is equal to the end. If it is not, then the tap address is increased by one (block 
3212), to consider the next tap of the filtw. If flie address has reached the end of the filter 
taps, ±ea process 3012 determines whether filter number is equal to 3, i.e., \rfiether all the 
filt«s in the transceiver have been considered (block 32 14). If not, then filter number is 
increased by one, so that the next filter is considered (block 3216). If process 3012 has 
operated on all the filters, then process 3012 terminates (block 3218). 

aior-computing blodc 3014 of FIG. 30 is applied sequentially to die echo 
canceUer 232 and the three NEXT cancellers 230 (of FIG. 2). FIG. 33 iUusliates the 
flowdiart of one embodiment of the oror-computing block 3014. 

Referring to FIG. 33, upon start (block 3302), die process 3014 sets the filter 
number to zero (block 3304) to operate on the echo canceller, and initializes flie error 
metric MSE, the power metric and flie flag. The filter number zero represents the echo 
canceller, while filter numbers 1 through 3 represent the three NEXT cancellers, 
respectively. Process 3014 then sets the address equal to the length of die filtor minus 1 
(block 3306) to scan flie filter taps &>m the highest ordered end. The reason for using this 
scanning order and the flag is to ensure that the taps diat will be powered down in block 
3028 of FIG. 30 will be excluded fiom die computation of die power metric. A 
deactivated tap still consumes a small amount of power if it is not actually powered down 
because of the associated delay line section. To compute the new power metric such that it 
can be used to accurately regulate die power consumption of the system, die process 3014 
must exclude fi»m the computation the power consumption of a deactivated tap that will 
be powered down. 

If TqiOnCaddr] is zero, i.e., if die tap at die specified address is turned oflF, tiien 
process 3014 computes the new error metric MSE by adding to die previous value of MSE 
the squared value of the tap coefficient at die specified address. Otiierwise, if die tap at die 
specified address is on, flien die flag is set to 1. Iftiie flag is l,tiien process 3014 
conq)utes the new powea: metric by adding to die previous value of die power metric the 
estimated power consumption TapPower of the tap having die specified address (block 
3308). TapPower is chosOT fiom precomputed values stored in a look-iq) table. These 
precomputed values are functions of die size of die coefficients and of die active or 
inactive status of the coefficient 

Process 3014 determines whedica: die address is 0 (block 3310). If it is not, flien 
the tap address is decreased by one (block 33 12), to consider die next tap of die filter. If 
die address has readied 0, dien process 3014 detCTmines whedier filter number is equal to 
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3, i.e.. whether aU the filters in the transceiver have been considered (block 33 14). If not, 
then filter number is increased by one, so that the next filter is considered and the flag is 
reset to 0 (block 3316). If process 3014 has operated on aU the filter, then process 3014 
terminates (block 3318). 

As shown in FIG. 33, the enor metric MSE is computed by summing the squared 
values of the deactivated tap coefficients. It is noted that the error metric can be computed 
dififerently. such as deriving it from the error component 42A of the 4-D error signal 42 
ou^utted fiom file trellis decoder 38 O'IG. 2). 

The MSE as measured fiom the error output 42 of the trcUis decoder 38 (FIG. 2) 
will be. hereinafter, referred to as the true MSE. The MSE as measured by summing the 
squared values of the coefficients of the deactivated taps wiU be, hereinafter, referred as 
the proxy MSE. 

There is an advantage m using the proxy MSE* instead of the true MSE. as the 
error metric. Since the proxy MSE is based solely on the coefficient values of the 
deactivated t^s, it represents only one componait of the noise signal of the gigabit 
transceiver (other components may be due to quantization noise, external noise, etc ) 

Therefore, the proxy MSE is unaffected when large external noise, other than echo or 
NEXT noise, severely affects the noise signal, hence the noise to signal ratio, of the gigabit 
transceiver. For this reason, the proxy MSE is preferred as the error metric. 

If the true MSE is used as the enor metric, then the specified error is preferably set 
at a value corresponding to a noise to signal ratio of about -22 dB, because, although 
theoretically, a tme MSE corresponding to a noise to signal ratio of-19dB is acceptable 
for the gigabit transceiver, in practice, it is difficult to obtain adequate system performance 
atthatlevel. Ifthe proxy MSE is used as the error metric, then the specified error is 
preferably set at a value corresponding to a noise to signal ratio of about -24 dB. 

Power-down block 3028 of HG. 30 is applied sequentially to tiie echo canceUer 
232 and tiie three NEXT canceUas 230 (of HG. 2). FIG. 34 iUustrates the flowchart of 
one embodiment of the powCT-down block 3028. 

Referring to FIG. 34, upon start (block 3402), the process 3028 sets die filter 
number to zero (block 3404) to operate on the echo canceller first The filter number zm) 
represents the echo canceUer. whUe filter numbers 1 tiirough 3 represent tiie fliree NEXT 
canceUers, respectively. Process 3028 tiien sets flie address equal to tiie lengtfi of tfie filter 
mmus 1 and tiie end equal to zero (block 3406). This means tiiat tiie process 3028 starts 
fiom tfie highest ordered end of tiie filter towards tiie lowest ordered end. 

Process 3028 determmes wheflier TapOn[addr] is 1, i.e.. whetiier tiie tap at tiie 
specified address is active (blodc 3408). If tiie tap is not active, tiien process 3028 turns 
off flie power to tiie tap (block 3410), tiien checks whetiier tiie address is equal to tfie end 
(block 3412). If flie address is not equal to tiie end, tiie address is decreased by 1 to 
consider tiie next lower ordered tap (block 3414). If tiie address has readied tiie end, flien 
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process 3028 detennines whether the filter number is 3, i.e., whether aU the 4 filters have 
been considered (block 3416). If the filter is not the last one, then filter number is 
increased by 1 so that the next filter is considered (block 3418). Otherwise, process 3028 
teiminates (block 3420). 

If TapOn[addr] is 1 (block 3408). i.e., if the tap at the specified address is active, 
then process 3028 stops scanmng the taps in the filter being considered, and checks the ' 
next filter, if any (block 3416). Process 3028 then proceeds as described above. 

The process 3000 of HG. 30 is appUed to the echo and NEXT cancellers of each of 
the 4 constituent transceivers of the gigabit transceiver 102 depicted in FIGs. 2 and 3. It is 
important to note that, if process 3000 is appUed simultaneously to the 4 constituent 
transceivers, there will be a power demand surge in the gigabit transceiver 102. In order to 
avoid such a power demand surge, process 3000 is appMed to the 4 transceivers in a time- 
staggered manner. 

In a second embodiment of the present invention, two different specified errors are 
used in order to avoid possible lunit cycle osciUations between activation and deactivation. 
The flowchart of the second embodiment is substantially similar to the one shown in nO. 
30. The second embodiment differs fiom the first embodiment by using a first specified 
error for the first test in block 3016 (FIG. 30) and a second specified error for the second 
test m block 3024. The first specified error is substantiaUy larger than the second 
specified error. The use of the two different specified errors, suflficiently distant fiom each 
other, allow the process 3000 to terminate when the computed error metric has a value 
located between the two specified errors. When just one specified error is used, as m the 
first embodiment, the computed error metric may jump back and forth around the specified 
«ror, causing flie process 3000 to osciUate between activation and deactivation. 

In a diird embodiment of die present invention, the first few taps of each filter, e.g., 
ae first 10 taps, are exempt fiom deactivation in order to avoid possible degradations of 
the system performance in the presence of jitter. The effect of jitter on these few taps is as 
follows. There is usually a large slew rate in these first few taps. Due to this slew rate, 
their numerical values could change significantfy if the sampling phase of the received' 
signal changes. In the presence of jitter, the sampling phase of the received signal can 
change dynamicaUy. ITius, if some of the first few taps were insignificant for the system 
perfi>rmance, they could become significant as the sampling phase changes. For this third 
embodiment, the flowchart of the deactivation process of block 3012 is sUghtly different 
fiom the one shown in HG. 30. The only modification to the flowchart of HG. 30 is to 
equate, in block 3006, the address to K mstead of 0, where K+1 is die number of the first 
few taps exempt fiom deactivatioru 

A fourth embodiment of the present invention uses, as flie error metric, the change 
in the true MSE mstead of the true MSE. In otiier words, flie value of {new (true MSE) - 
old (true MSE)) is computed and used as die error metric. In flie fourth embodunent, die 
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first test in block 301 6 is satisfied if the change in the true MSE is greater than a specified 
change value (e.g., a value that corresponds to a noise to signal ratio (NSR) change of 
IdB) and the power metric is smaller than the specified maximum power. The second test 
in block 3024 is satisfied if the change in the true MSE is smaller than the specified 
change value or the power metric is greater than the specified maximum power. For 
example, if the true MSE is at a value corresponding to a NSR of -25 dB before the tap 
power regulating process is spphcd^ and if the specified change value corresponds to a 
NSR change of 1 dB, then the final true MSE will be at a value corresponding a NSR of 
about -24 dB. This foiulfa embodiment can be used when there is large external noise tliat 
is otho- than echo or NEXT noise. In such a case, the true MSE is large even before the 
tap power regulation process is jqjplied. Thus, if the true MSE is used as the error metric, 
practically no taps will be deactivated, resulting in large power dissipatiorL In this 
situatiori, since the large noise is not caused by the uncancelled echo and NEXT 
impairments, a large number of taps could be deactivated without causing significant 
additional degradation to the system performance. The fourth embodiment allow these 
t^ to be deactivated in fiiis situation* 

In a fifth embodiment, all of the taps in a filter are initially activated and 
converged, instead of being activated in blocks and converged in stages as in the first 
embodiment. The flowchart of the fifth embodiment is similar to the one of the first 
embodiment shown in FIG. 30. except for the following two diflFa:ences. The first 
diflference is that, in the activation block 3006, the block of t^s is set to include all of the 
taps in the filter. The second difference is that the block 3020 is not needed. 

In each of tiie embodim^ts, there are several ways of computing the error metric. 
The error metric can be computed as a measurement of systmi performance degradation 
caused by the filter being considered, or by the four filters in the constituent transceiver 
being considered, or by all the 16 filters in the fom- constituent transceivers of the gigabit 
transceiver. 

When computed as a measurement of degradation caused by all 4 filters of the 
constituent transceiver being examined, the mot metric provides a good uidication of the 
bit error rate of that constituent transceiver. 

In the case A^diere the error metric is compirted as a measurement of degradation 
caused by all the 16 filters in the 4 constituent transceivers of the gigabit transceiver, the 
power regulation process can allow the filters in one of the 4 transceivers to have larger 
error and compensate for this error in the filters of the other 3 transceivers. For example, 
if the echd/NEXT impairments in one particular transceiver are very severe and too many 
active taps would be needed to cancel them, dien the power regulation process can allow 
the impairments to st^ severe in this transcdyer, and allocate the power resource to the 
other 3 transceiva:s instead. It is noted that, in this case, the trellis decoder 38 still 
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decodes correctly since it uses signal samples from all the four transceivers in its decoding 
scheme. 

When applied to the echo and NEXT cancellers of the gigabit transceiver, for 
typical channels, the power regulation process of the present invention results in a large 
number of taps being deactivated and the power consumption being reduced by a large 
fector. Simidation tests confirm this result. 

FIG. 3 5 illustrates an exemplary impulse response of the echo characteristic 
developed by a typical multi-pair transmission channel in response to a known impulse. 
FIG, 36 illustrates an exemplary impulse response of the near end crosstalk (NEXT) 
characteristics developed by a typical transmission channel in response to a similar known 
impulse. FIGs. 37A and 37B illustrate die results of simulation programming performed 
to evaluate tiie application of tap powa* regulation methodologies to a local constituent 
transceiver and a remote constituent transceiver coimected together through a transmission 
channel having the echo impiilse response of FIG. 35. 

During flie initial period of commimication, through a process known as Auto- 
Negotiation, the two transceivers negotiate then agree on their respective status as Master 
and Slave. FIGs. 37A and 37B show the MSE to signal ratio expressed in dB as a function 
of time, with time e^ressed in bauds, for the Master and Slave transceivers, respectively. 
Each point on the graphs in FIGs. 37A and 37B is obtained by averaging the instantaneous 
measurements taken over 10,000 symbol periods. The error metric MSE is computed 
based on the error signal 42A (in FIG. 2), i.e., the error as seen by the trellis decoder 38 
(FIG. 2). 

Refisrring to FIGs. 37A and 37B, during the time interval from 0 baud to about 1 .2 
X 10* bauds, the Master trains its own echo canceller while transmitting with an 
independent, fixed clock. During this time interval, the Slave synchronizes to the signal 
transmitted by the Master, and trains its feed-forward equalizer and its timing recovery 
block. During the time interval from about 1.2 x 10^ bauds to about 2.2 x 10* bauds, the 
Slave trains its echo canceller while transmitting. During this time interval, the Master is 
not transmitting, only receiving from the Slave, and trains its feed-forward equalizer and 
its timing recovery block to account for the delay in the charmel. By the end of this time 
interval, the Master and Slave are synchronized with each other. 

During the time interval from about 2.2 x 10* bauds to about 3.2 x 10* bauds, both 
the Master and Slave transmit and receive. During this time intmral, the Master retrains 
its edio canceller and readjust timing. From about 3.2 x 10* bauds, there is convergence 
of both Master and Slave echo cancellers. At about 3.6 x 10* bauds, the tap power 
regulating process of the present invention is q)plied to both echo cancellers, with the 
specified error, i.e., the maximum acceptable system performance degradation, set at a 
value corresponding to a NSR of -24 dB. As shown in FIGs. 37A and 37B, for both local 
and remote transceivers, the MSE increases to and stays at this specified error 
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corresponding to a NSR of -24 dB. In this example, in each constituent transceiver, the 
echo canceUer has initially 140 taps, and each of the three NEXT cancellers has initially 
100 taps. The total number of initial taps in each constituent transceiver is 440. 

FIGs. 38A and 38B are graphs of die values of the tap coefficients of die echo 
canceller as a fimction of die tap numba^, after appUcation of the tap power regulation 
process widi the specified error set at values corresponding to noise to signal ratio of -24 
dB and -26 dB, respectively. The deactivated coefficients are shown as having value zero. 

Referring to FIG. 38A, the number of taps remainmg active, after jpplication of the 
tap power regulation process with the specified error corresponding to a NSR of -24 dB, is 
22. For fliis specified error, the remaining active taps for the three NEXT cancellers is 6, 
2, and 0, respectively (not illustrated). Thus, out of a total of 440 initially active taps in 
tiie constituent transceiver, only 30 remain active after application of die process of tiie 
present invention, vAule a 5 dB margin is maintained for flie required bit error rate. 

Referring to FIG. 38B, after qjplication of the tap power regulation process witii 
die specified error corresponding to a NSR of-26 dB, the number of taps remaining active 
is 47. For fliis spedfied error, flie remaining active taps for tiie fliiee NEXT cancellers 
(not illustrated) is 6, 2, and 0, respectively. Thus, out of a total of 440 initially active taps 
in flie constituent tiansceiver, only 55 remain active after application of tiie process of tfie 
present invention, while a 7 dB margin is maintmned for the required bit error rate. 

FIGs. 38A and 38B show that tiie surviving taps occur at sparse locations. This is 
due to the strong dependence of die echaWEXT canceUers on tiie specific cable response. 
Since tfie response characteristics of any given cable making up tiie transmission channel 
are not a priori determinable, it would be impossible, in practice, to predict and statically 
allocate tiie surviving taps during tiie design of die echo and NEXT canceUers. Therefore, 
some sort of dynamic active tap identification and allocation process according to the 
invoition offers significant power reduction benefits over conventional methodologies. 

While tiie ^sterns and methods of the invention have been described mainly in 
terms of tiieir qjplicability to adaptively configuring active tap sets for high order digital 
filters, tiie dynamic power regulation mefliodology of flie present invention can also be 
applied to complete computation modules of a transceiver, in cases where the 
conqjutational power of such modules is not needed for a particular application. In fliese 
caseSi a similar metiiodology applies, i.e., evaluate a signal performance metric of a signal 
ou^ fiom a computational module against a performance threshold and, where the 
performance metric is greater tiian flie threshold, power down flie computational module. 

This additional embodiment of the invention is particularly advantageous in cases 
v^dierc flie transmission channel might be in^lemented with short (<3 meters) cable 
lengflis, resulting in the relative absence of transmission channel induced intersymbol 
interfoence (KSI). Returning momentarily to flie description of the trellis decoder circuit 
accompaiqring FIG. 3, in the absence of intersymbol interference, symbols received fiom 
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the deskew memory 37 need only be decoded by the Viterbi decoder 604, and its 
associated modules, i.e., the path metrics module 606, and the path memory module 608, 
without resorting to a decision-feedback sequence estimation approach, as discussed 
previously. In this case, the dynamic power regulation process reduces the power 
consumption of the gigabit transceiver by deactivating and bypassing the computational 
modules represented by the MDFE 602, the DFE 612 and the select logic 610. Since 
received symbols are relatively uneffected by charmel induced ISI, there is no need to 
develop ISI compensation for inconmiing signal samples prior to symbol decode, and 
therefore no need for ISI compensation circuitry, 

FIG. 39 is a simplified, semi*schematic block diagram of an exerriplary trellis 
decoder 38 as it might be implemented in the case wdiere it has been determined that there 
is substantially no channel induced intersymbol interference. Referring to FIG. 39, the 4- 
D output signal 37 fix>m the deskew memory 36 is provided directly to the Viterbi decoder 
604, as the Viterbi input In accordance with the invention, it should be noted that, in the 
absence of intersymbol interference, only a single 4-D Viterbi input is needed in contrast 
to the eight state inputs required in the fiill ISI compensation case. 

As illustrated in FIG. 39, the DFE, MDFE and decoder circuitry has been replaced 
by a series of simple delay stages and an adder circuit, with the deskew output signal (a 
signal sample) directly input to the Viterbi decoder 604. The deskew output signal sample 
is also directed through a set of three series coupled sequential delay stages 3920, 3922 
and 3924 and then to an adder circuit 3926. Signal samples are added to the negative of 
flie first tentative decision Vqf output by the path memory module 608 in tiiie adder circuit 
3926 in order to develop an error term. The error term is directed through an additional 
delay stage 3928 after vMch the OTor term 42 might be directed to an adaptive gain stage 
(34 of FIG. 2) and timing recovery circuit (222 of FIG. 2). In the exemplary embodiment 
shown in FIG. 39, the 4-D error 42 is computed as the delayed difference between the 
delayed 4-D ii^ut 37 and the 4-D output Vqf ofthe path memory module 608. The 
corresponding 4-D tentative decision 44 may be represented as nothing more than a 
delayed version of the 4-D ou^ut Vq^ of the path memory module 608; the delay occuring 
in an additional del^ stage 3930. In the embodiment shown in FIG. 39, flie error and 
tentative decision delay elements 3928 and 3930, respectively, are used to ensure that the 
error 42 and the tentative decision 44 arrive at the timing recovery block (222 of FIG. 2) at 
the same time; Depending on the design and implementation of the timing recovery bloc^, 
flxese delay elements may not necessarily be needed in alternative embodiments. 

FIG. 40 illustrates yet a fiutiier embodimrat of the invention which is particularly 
advantageous in situations where the signal-to-noise ratio is very high (as may happen 
with a short cable, e.g., of less than 50 meters). In such situations, the coding gain 
provided by the trellis code may not be needed, and adequate system performance, as 
indicated by the bit error rate, may be adiieved \\ithout making use of this coding gain. In 
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these situations, substantial power dissipation reductions can be achieved by disabling the 
trellis decode enabling features of the complex Viterbi decoder, including the Viterbi 
decoder block 604. its associated path metric and path memory modules 606 and 608. and 
a large portion of the ISI compensation circuitry including the MDFE 302 and the select 
logic 610. These portions are replaced, or substituted, with a simple symbol-by-symbol 
decoder and a simple decision feedback equaliEer to detect the received signal, instead of 
using the computationally complex Viterbi decoder. 

Refening to FIG. 40, signal samples output by the deskew memoiy are directed 
through an adder circuit 4032, which determines the difference between the input signal 
samples and the 4-D output of a DFE 4034. A symbol-by-symbol decoder 4036 receives 
the difference between the 4-D signal samples and the 4-D output fiom the DFE 4034 and 
decodes it A 4-D tentative decision 44 is taken directly from the output of the symbol-by- 
symbol decoder 4036, and an error term 42 is developed by an additional adder circuit 
4038, coupled to define the difference between the input and the ou^ut of the symbol-by- 
symbol decoder 4036. A soft decision 43, which is used for display purposes only, is 
taken directly fiom the input of the symbol-by-symbol decoder 4036. 

Final decisions are developed by delaying the output of the symbol-by-symbol 
decoder through three series coupled sequential delay stages 4040, 4042 and 4044. The 
output of each respective delay stage is duected to the DFE as a conespondmg tentative 
decision VoF, V,p and Vjj:. 

In each of the cases described in connection with FIGs. 39 and 40, it will be 
understood that the surviving elements of the decoder section are all present in a folly 
functional Viterbi decoder system with ISI compensation. Such a system is described in 
co-pending U.S. patent application entitled System and Method for High-Speed Decoding 
and ISI Compensation in a Multi-Pair Transceiver System, filed on instant date herewith 
and commonly owned by the assignee of the present invention, the entire contents of 
which are ejqjressly incorporated by reference. As decisions are made with regard to the 
desirabiUty of mamtaining the circuitry in a fiilly operational condition or truncating 
certam computational sections in an effort to reduce power dissipation, the system need 
only remove power fiom certain identified portions of the circuitry, with other identified 
portions aUowed to remain powered-up in the active signal path. No additional 
componott circuit elements need be provided. 

The dynamic power regulation methodology of the present invention can also be 
appUed to any other component module of a communication system, so long as that 
module is able to provide a given mimmal level of performance with a truncated 
fimctional representation or wifli truncated circuitry. Of course, such minimal 
performance levels will obtain in only certam situations and are dependent on external 
fectors, particularly die transmission diannel diaracteristics. However, these situations 
fi»quently appeal in a substantial number of plications or installations. An mtegrated 
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circuit transceiver capable of adaptively configuring itself to provide a * just sufiBcient" 
level of performance while operating at the lowest obtainable power dissipation levels 
would lend itself to almost universal application. 

The present invention further provides a method and a timing recovery system for 
generating a set of clock signals in a processing system. The set of clock signals includes 
a set of sampling clodc signals. The processing system includes a set of processing 
subsystems, each of which includes an analog section. Each of the analog sections 
operates in accordance with a corresponding sampling clock signals. An example of the 
processing system is a gigabit transceiver. In this case, the processing subsystems are the 
constituent transceivers. 

Tlie present invention can be used to generate and distribute clock signals in a 
gigabit transceiver of a Gigabit Ethernet communication system such that effect of 
switching noise coupled from one clock domain to another clock domain is minimized. 
By "clock domain", it is meant the circuit blocks that are operating according to transitions 
of a particular clock signal. For ease of explanation, the present invention will be 
described in detail as applied to this exemplary application. However, this is not to be 
construed as a limitation of the present invention. 

In order to appreciate the advantages of the present invention, it will be beneficial 
to describe the invention in the context of an exemplary bi-directional commimication 
device, such as an Ethernet transceiver. The particular exemplary implementation chosen 
is depicted in FIG. 1, which is a simplified block diagram of a multi-pair communication 
system operating in conformance with the ffiEE 802.3ab standard (also termed 
lOOOBASE-T) for 1 gigabit (Gb/s) Ethernet fiiU-di^lex coromunication over four twisted 
pairs of Category-5 copper wires. 

In FIG. 1, the coimnunication system is represented as a point-to-point system in 
order to simplify the e^qilanation, and includes two main transceiver blocks 102 and 104, 
coupled togeflier via four twisted-pair cables 1 12a, b, c and d. Each of the wire pairs 1 12a, 
b, c, d is coupled to each of the transceiver blocks 102, 104 through a respective one of 
four line mterfece circuits 106. Each of the wire pairs 112a, b, c, d facilitates 
conomunication of information between corresponding pairs of four pairs of 
transmitt^receiver circuits (constituoit transceivers) 108. Each of the constituent 
transceivers 108 is coiqiled between a respective line interface circuit 106 and a Physical 
Coding Sublaya- (PCS) blodc 1 10. At each of the transceiver blocks 102 and 104, the four 
constituoit transceivers 108 are capable of operating simultaneously at 250 megabits of 
information data per second (Mb/s) each, and are coupled to the corresponding remote 
constitu^t transceivers through respective line int^ace circuits to facilitate full-duplex 
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bi-directional operation. Thus, IGb/s communication throughput of each of the 
transceiver blocks 102 and 104 is achieved by using four 250 Mh/s (125 Mbaud at 2 
information data bits per symbol) constituent transceivers 108 for each of the transceiver 
blocks 102, 104 and four pairs of twisted copper cables to connect flie two transceiver 
blocks 102, 104 together. 

The exemplary communication system of FIG. 1 has a superficial resemblance to a 
100BASE-T4 system, but is configured to operate at ten times the bit rate. As such, it 
should be understood that certain system performance characteristics, such as sampling 
rates and the like, will be consequently higher and cause a greater degree of power 
consunqition. Also, at gigabit data rates over potentially noisy channels, a proportionately 
greater degree of signal processing is required m many instances to insure an adequate 
degree of signal fidelity and quahty. 

FIG. 2 is a snnplified block diagram of the functional architecture and internal 
constouction of an exemplary transceiver block, indicated generaUy at 200, such as 
transceiver 102 of FIG. 1. Since flie illustrative transceiver appKcation relates to gigabit 
Ethernet hansmission, the transceiver will be referred to as the "gigabit transceiver". For 
ease of illustration and description. FIG. 2 shows only one of the four 250 Mb/s 
constihient transceivers which are operating simultaneously (termed herein 4-D operation). 
However, since the operation of the four constituent ti^ceivers are necessarily 
interrelated, certain blocks and signal lines in the exemplary embodiment of FIG. 2 
perform four-dimoisional operations and carry four-dimensional (4-D) signals, 
respectively. By 4-D. it is meant that flie data from flie four constiturat hansceivers are 
used simultaneously. In order to clarify signal relationships in FIG. 2, fliin lines 
correspond to 1-dimensional functions or signals (i.e., relating to only a single constituent 

transceiver), and tiiick lines conespond to 4-D fimctions or signals (rehiting to all four 
constituent transcdvos). 

Referring to FIG. 2, flie gigabit transceiver 200 includes a Gigabit Medium 
Indqiendent Interfece (GMII) block 202 subdivided into a receive GMH dicuit 202R and 
a tiansmit GME circuit 202T. The transceiver ako includes a Physical Coding Sublayer 
(PCS) block 204, subdivided into a receive PCS circuit 204R and a transmit PCS circuit 
204T. a pulse shying filter 206, a digital-to analog (D/A) converter block 208, and a line 
interfece block 210, aU generaUy encompassing flie transmitter portion of flie transceiver. 
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The receiver portion generaUy includes a highpass filter 212, a programmable gain 
amplifier (PGA) 214, an analog-to-digital (A/D) converter 216. an automatic gain control 
(AGC) block 220, a timing recovery block 222. a pair-svi'ap multiplexer block 224, a 
demodulator 226, an oflfeet canceUer 228, a near-end crosstalk (NEXT) canceller block 
230 having three constituent NEXT cancellers and an echo canceller 232. 

The gigabit transceiver 200 also includes an A/D first-in-first-out buffer (FIFO) 
218 to fecilitate proper transfer of data fiom the analog clock region to the receive clock 
region, and a loopback FIFO block (LPBK) 234 to facUitate proper transfer of data fiom 
tiie transmit clock region to the receive clock region. The gigabit transceiver 200 can 
optionally include an additional adaptive filtw to cancel far-end crosstalk noise (FEXT 
canceller). 

In operational tains, on the transmit path, tiie transmit section 202T of the GMU 
block receives data fiom tiie Media Access Control (MAC) module in byte-wide fi)rmat at 
the rate of 125 MHz and passes them to the transmit section 204T of the PCS block via the 
FIFO 201 . The FIFO 201 eaisures proper data transfer from the MAC layer to the Physical 
Coding (PHY) layer, since the transmit clock of the PHY layer is not necessarily 
synchronized with the clock of the MAC layer. In one embodiment, this smaU FIFO 201 
has fixjm about three to about five memory cells to accommodate the elasticity 
requirement whidi is a fimction of fi:ame size and frequency o£&et. 

The PCS transmit section 204T perfonns certain scrambling operations and, in 
particular, is responsible for encoding digital data into the requisite codeword 
rqiresentations ^ropriate for transmission. In the illusb^ed embodiment of FIG. 2, the 
tiansmit PCiS section 204T incorporates a coding engine and signal mapper that 
in^lanents a trellis codmg architecture, such as reqmred by the IEEE 802,3ab 
specification for gigabit transmission. 

In accordance wifli this encoding architecture, the PCS transmit section 204T 
generates four l-D symbols, one for each of the four constituent transceivers. The 1-D 
symbol generated for the constituent transceiver depicted in HG. 2 is filtered by die pulse 
shying filta: 206. This filt«ing assists in reducing tiie radiated emission of tiie ou^ut of 
the transceiver such that it fells wifliin tiie parameters required by the Federal 
Communications Commission. The pulse shaping filter 206 is unplemented so as to 
define a transfer fimction of 0.75 ■H).25r'. TTiis particular implementation is chosen so 
that tfaepower spectnnn of the output of the transceiver fells below the power spectrum of 
a 100Base-TX signal. The lOOBase-Tx is a widely used and accepted Fast Ethernet 
standard for 100 Mb/s operation on two pairs of Category-5 twisted pair cables. The 
ou^ut of the pulse shying filter 206 is converted to an analog signal by the D/A converter 
208 operating at 125 MHz. The analog signal passes through the line interfece blodc 210, 
and is placed on the corresponding twisted pair cable. 
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On the receive pafli, the line intafece block 210 receives an analog signal fiom the 
twisted pair cable. The received analog signal is preconditioned by flie hi^ass filter 2 1 2 
and flie PGA 214 before bemg converted to a digital signal by the A/D converter 216 
operating at a sampling rate of 125 MHz. The timing of the A/D converter 216 is 
controUed by the ou^ut of the timing recovery block 222. The itsulting digital signal is 
properly transferred fiom die analog clock region to the receive clock region by the A/D 
FIFO 218. The output of the A/D FIFO 218 is also used by the AGC 220 to control the 
operation of the PGA 214. 

The output of the A/D FIFO 218, along with the outputs fipm the A/D FIFOs of 
the oflia- three constituent transceivers are inputted to the pair-swjqi multiplexer block 
224. The pair-swap multiplexer block 224 uses the 4-D pair-sw^ control signal fiom the 
receive section 204R of PCS block to sort out the four mput signals and send the correct 
signals to the respective feedforward equahzers 26 of the demodulator 226. This pair- 
swiping control is needed for the foUowing reason. The treUis coding mefliodology used 
for the gigabit transceivers (102 and 104 of FIG. 1) is based on the feet that a signal on 
each twisted pair of wire corresponds to a respective 1-D constellation, and that flie signals 
transmitted ova- four twisted pairs collectively form a 4-D constellation. Thus, for tfie 
decoding to work, each of the four twisted pairs must be uniquely identified wifli one of 
flie four dimensions. Aixy undetected swapping of flie four pairs would result in erroneous 
decoding. In an alternate embodimoit of the gigabit transceiver, flie pair-swapping control 
is performed by the demodulator 226, instead of the combination of the PCS receive 
section 204R and the pair-swap multiplexer block 224. 

The donodulator 226 includes a feed-forward equalizer (FFE) 26 for each 
constituent transceivw, cpipled to a deskew memory circuit 36 and a decoder circuit 38, 
in^lemented in flie illustrated embodiment as a treUis decodrar. The deskew memory 
drcuit 36 and flie trellis decoder 38 are conunon to all four constituent transceivers. The 
FFE 26 receives flie received signal intoided for it fiom tiie pair-swap multiplexer block 
224. The FFE 26 is suitably in^lonented to include a precursor filter 28, a programmable 
inverse partial response (IPR) filter 30. a summing device 32. and an adaptive gain stage 
34. The FFE 26 is a least-mean-squares (LMS) type a4q)tive filter which is configured to 
perform channel equalization as will be described in greater detail below. 

The precursor filter 28 gaierates a precursor to flie input signal 2. This precursor is 
used for timing recovoy. The transfer fimction of ttie precursor filter 28 mi^t be 
represented as -g +2"', wifli g equal to 1/16 for short cables (less flian 80 meters) and 1/8 
fin- long cables (more flian 80 m). The determination of flie lengtii of a cable is based on 
the gain of flie coarse PGA 14 of flie programmable gain block 214. 
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The programmable IPR filter 30 conqjensates the 1SI (intensymbol interference) 
introduced by the partial response pulse shying in the transmitter section of a remote 
transceiver which transmitted the analog equivalent of the digital signal 2. The transfer 

5 functionoftheIPRfilter30maybeexpressedas 1/(1+K2-'). In the present example. K 
has an exemplary value of 0.484375 during startup, and is slowly ramped down to zJro 
after convergence of the decision feedback equalizer included inside the trellis decoder 38^ 
The value of K may also be any positive value stricUy less than 1. 

The summing device 32 receives tile output of the IPR filter 30 and subtracts 

iO &erefi»m ad^tively derived canceUation signals received fiom the adaptive filter block, 
namely signals developed by the ofl&et canceller 228, the NEXT cancellers 230, and the 
echo canceUer 232. The ofl&et canceUer 228 is an adaptive filter which generat^ an 
estimate of signal oflfeet introduced by component circuitry of the transceiver's analog 

fiont end, particularly offeets introduced by the PGA 214 and the A/D converter 216. 
15 "^^^^ NEXT cancellers 230 may also be described as adaptive filtere and are 

used, in the illustrated embodiment, for modeling the NEXT impairments in the received 
signal caused by interference generated by symbols sent by the three local tiansmittera of 
«he other three constituent transceivers. These impairments are recognized as being 

caused by a crosstalk mechanism between neighboring pairs of cables, thus the term near- 
20 end crosstalk, or NEXT. Since each receive has access to the data transmitted by the 
other three local transmitters, it is possible to approximately replicate the NEXT 
impairmaits through filtering. Referring to HG. 2, the three NEXT canceUers 230 filter 
the signals sent by flie PCS block to the other three local transmitters and produce three 
signals rq)Ucating flie respective NEXT impahments. By subtracting these three signals 
25 fiom the output of flie IPR filter 30, the NEXT impairments are approximately cancelled. 

Due to the bi-directional nature of the chaimel, each local transmitter causes an 
echo impairment on the received signal of the local receiver with which it is pahed to form 
a constihient transceiver. In order to remove this impamnent, an echo canceller 232 is 
provided, which may also be characterized as an adaptive filter, and is used, m the 
30 iUustrated embodiment, for modeling the signal impairment due to echo. The echo 

canceUer 232 filters the signal sent by the PCS block to the local transmitter associated 
with the receiver, and produces an approximate rephca of the echo impaitmenL By 
subtracting this repUca signal fiom the output of the IPR filter 30, the echo inqjairment is 
qjproximately cancelled. 
35 The adaptive gain stage 34 receives the processed signal fiom the summing circuit 

32 and fine ftmes the signal path gain using a zero-forcing LMS algorithm. Since this 
a<i^tive gain stage 34 trains on the basis of error signals generated by Uie adaptive filters 
228, 230 and 232. it provides a more accurate signal gain than the one provided by the 
PGA 214 in the analog sectioiL 

The ou^ut of the adq)tive gain stage 34, which is also the output of the FFE 26, is 
inputted to the deskew memory circuit 36. The deskew memory 36 is a four-dimensional 
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function block, i.e., it also receives the outputs of the three FFEs of the other three constituent 
transceivers. There may be a relative skew in the outputs of the four FFEs, which are the four 
signal samples representing the four symbols to be decoded This relative skew can be up to 
50 nanoseconds, and is due to the variations in the way the copper wire pairs are twisted In 
order to correctly decode the four symbols, the four signal samples must be properly aUgned. 
The deskew memory aUgns the four signal samples received fiom the four FFEs. then passes 
the deskewed four signal samples to a decoder circuit 38 for decoding. 

In the context of the exemplary embodiment, flie data received at the local 
transceiver was encoded before transmission, at the remote transceiver. In the present 
case, data might be encoded using an 8-state four-dimensional trclUs code, and the decoder 
38 might therefore be in^jlemented as a trellis decoder. In the absence of intersymbol 
interference (IST), a proper 8-state Viteibi decodo" would provide optimal decoding of this 
code. However, m flie case of Gigabit Ethernet, the Category-5 twisted pair cable 
introduces a significant amount of ISL In addition, the partial response filter of the remote 
transmitter on the other end of the communication channel abo contributes some ISL 
Therefore, the trellis decoder 38 must decode both the treUis code and the ISI, at the high 
rate of 125 MHz. In the iUustrated embodhnent of the gigabit transceiver, the trellis 
decoder 38 includes an 8-statc Viterbi decoder, and uses a decision-feedback sequence 
estimation approach to deal witii the ISI components. 

The 4-D output of the trellis decoder 38 is provided to the PCS receive section 
204R. The receive section 204R of the PCS block de-scrambles and decodes the symbol 
stream, flien passes the decoded packets and idle stix;am to the receive section 202T of the 
GMH block which passes them to the MAC module. The 4-D outputs, which are the error 
and tentative decision, respectively, are provided to the timing recovery block 222, whose 
output controls the sampUng time of the A/D converter 216. One of the four components 
of the error and one of tiie four components of the tentative decision correspond to the 
receivw shown m FIG. 2. and are provided to flie ad^tive gain stage 34 of ttie FFE 26 to 
adjust the gain of flie equalizer signal patii. The error component portion of flie decoder 
output signal is also provided, as a control signal, to adj5)tation circuitry incorporated in 
each of flie ad^tive filters 230 and 232. Adq)tation circuitry is used for flie updating and 
training process of filta- coefficiaits. 

For flie exemplary gigabit transceiver system 200 described above and shown in 
FIG. 2, fliae is a PHY Control system (not shown) vMch provides control signals to tfie 
blocks shown in FIG. 2, includmg flie timing recovery block 222, to control flieir 
fimctions. 

For flie exemplary gigabit transceiver system 200 described above and shown in 
FIG. 2, flierc are design considerations regarding flie aUocation of boundaries of flie clock 
domains. These design considerations are dependent on flie clocking relationship between 
transmitters and receivers in a gigabit transceiver. Therefore, fliis clocking relationship 
will be discussed first 
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During a bidirectional communication between two gigabit transceivers 102, 104 
(FIG. 1), through a process called "auto-negotiation", one of the gigabit transceivers 
assumes the role of the master while the otho- assumes the role of the slave. When a 
^gabit transceiver assumes one of the two roles with respect to the remote gigabit 
transceiver, each of its constituent transceivers assumes the same role with respect to the 
corresponding one of the remote constituent transceivers. Each constituent transceiver 108 
is constructed such that it can be dynamically configured to act as either the master or the 
slave with respect to a remote constituent transceiver 108 during a bidirectional 
commimication. The clocking relationship between the transmitter and receiver inside the 
constituent transceiver 108 depends on the role of the constituent transceiver (i.c., master 
or slave) and is different for each of the two cases. 

FIG. 19 illustrates the geaieral clocking relationship on the conceptual level 
between the transmitter and the receiver of the gigabit Ethernet transceiver (102 or 104) of 
FIG. L For this conceptual FIG. 19, the transmiitter TX represents the four constituent 
transmitters and the receiver RX represents the four constituent receivers. 

Referring to FIG. 19, the gigabit transceiver 1901 acts as the master while the 
gigabit transceiver 1902 acts as the slave. The master 1901 includes a transmitter 1910 
and a receiver 1912, The slave 1902 includes a transmitter 1920 and a receiver 1922. Hie 
transceivo- 1901 (respectively, 1902) receives &om the GMII 202T (FIG. 2) the data to be 
transmitted TXD via its input 1913 (r^ectively, 1923), and the GMII transmit clock 
GTX^CLK (tills clock is also called "gigabit transmit clock" in tiie IEEE 802.3ab 
standard) via its input 1915 (respectively, 1925). The transceiver 1901 (respectively, 1902) 
sends to the GMH 202R (FIG. 2) the received data RXD via its output 1917 (respectively, 
1927), and Hxe GMII receive clock RX_CLK: (this clock is also called "gigabit receive 
clock" in the IEEE 802.3ab standard) via its output 1919 (respectively, 1929). It is noted 
that the clocks GTX_CLK and RX_CLK may be different from the transmit clock TCLK 
and receive clock RCLK, respectively, of a gigabit transceiver. 

The receiver 1922 of the slave 1902 synchronizes its receive clock to the transmit 
clock of the transmitter 1910 of the master 1901 in ozd^ to properly receive the data 
transmitted by the transmitter 1910. The transmit clock of the transmitter 1920 of the 
slave 1902 is essentially the same as the receive clock of the receiver 1922, thus it is also 
synchronized to the transmit clock of the transmitter 1910 of the master 1901. 

The receive: 1912 of the mast^ 1901 is synchronized to the transmit clock of the 
transmitter 1920 of the slave 1902 in order to properly receive data sent by the transmitter 
1920. Because of the synchronization of the receive and transmit clocks of the slave 1902 
to flie transmit clock of transmitter 1910 of the master 1901, the receive clock of the 
receivo" 1912 is synchronized to the transmit clock of the transmitter 1910 with a phase 
delay (due to the twisted pairs of cables). Thus, in the absence of jitter, afta- 
syndnonization, the receive clock of receiver 1912 tracks flie transmit clodc of transmitter 
1910 witti a phase delay. In other words, in principle, the receive clock of receiver 1912 
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has the same frequency as the transmit clock of transmitter 1910, but with a fixed phase 
delay. 

However, in the presence of jitter or a change in the cable response, these two 
clocks may have different instantaneous frequencies (frequency is derivative of phase with 
respect to time). This is due to the fact fliat, at the master 1901, the receiver 1912 needs to 
dynamically change the relative phase of its receive clock with respect to the transmit 
clock of transmitter 1910 in order to track jitter in the incoming signal from the transmitter 
1920 or to compensate for the change in cable response. Thus, in practice, the transmit 
and receive clocks of the mast^ 1901 may be actually independent. At the master, this 
independoice creates an asynchronous boundary between the transmit clock domain and 
the receive clock dpmairL By "transmit clock domain", it is meant the region where cirxniit 
blocks are op^ted in accordance with transitions in the transmit clock signal TCLK. By 
"receive clock domain", it is meant the region where circuit blocks are operated in 
accondance with transitions in the receive clock signal RCLK. In order to avoid any loss 
of data when data cross the asynchronous boundary between the transmit clock domain 
and the receive clock domain inside the master 1901, FIFOs are used at this asynchronous 
boundary. For the exemplary structure of the gigabit transceiver shown in FIG. 2, FIFOs 
234 (FIG, 2) are placed at this asynchronous boundary. Since a constituent transceiver 
108 (FIG. 1) is constructed such that it can be configured as a master or a slave, the FIFOs 
234 (FIG. 2) are also included in the slave 1902 (FIG. 19). 

At the slave 1902, the transmit clock TCLK of transmitter 1920 is phase locked to 
the receive clock RCLK of receiver 1922. Thus, TCLK may be diflFerent from 
GTX_CLK, a FIFO 1930 is needed for proper transfCT of data TXD 6om the MAC (not 
shown) to the transmitter 1920. The depth of the FIFO 1930 must be sufficient to absorb 
any loss during the length of a data packet Hie multiplexer 1932 allows to use either the 
GTX^CLK or the receive clock RCLK of receiver 1922 as the signal RX_CLK 1929. 
When the GTX_CLK is used as the RX_CLK 1929, the FIFO 1934 is needed to ensure 
proper transfer of data RXD 1927 bom the receiver 1922 to the MAC. 

For the concqjtual block diagram of FIG. 19, there are one transmit clock TCLK 
and one receive clock RCLK for a gigabit transceiver. The transmit clock TCLK is 
common to all four constituent transceivers since data transmitted simultaneously on all 
four twisted pairs of cable correspond to 4D symbols. Since data received from the four 
twisted pairs of cable are to be decoded simultaneously into 4D symbols, it is an efficient 
design to have all flie digital processing blocks clo<^ed by one clock signal RCLK. 
HowevQ-, due the dififerent cable responses of the four twisted pairs of cable, the A/D 
converter 216 (FIG. 2) of each of the four constituOTt transceivers requires a distinct 
sampling clock signal. Thus, in addition to the signals TCLK and RCLK, the gigabit 
transceiver system 200 requires four sampling clock signals. 

ThCTc is an alternative structure for the gigabit transceiver ^ere the partition of 
clock domains is difieroit than the one shown in FIG. 2. This alternative structure (not 
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shown explidtty) is similar to the one shown in FIG. 2 and only diflfers in that its transmit 
clock domain includes both the transmit clock domain and the receive clock domain of 
FIG. 2, and that flie FIFO block 234 is not needed. In other words, in this alternative 
structure, the receive clock RCLK is the same as the transmit clock TCLK, and the 
tmnsmit clock TCLK is used to clock both the transmitter and most of the receiver. The 
advantage of this alternative structure is that there is no asynchronous boundary between 
the transmit region and most of the receive region, thus allowing the echo canceller 232 
and NEXT cancellers 230 to work with only one clock signal. The disadvantage of this 
alternative structure is that there is a potential for a perfiarmance penalty at the master 
when the constituent transceivers are tracking jitter. As a result of tracking jitter, the 
relative phase of a sampling clock signal with respect to the transmit clock TCLK may 
vary dynamically. This could cause the A/D converter to ssmph at noisy instants \ft*ere 
transistors in circuit blocks operating according to the clock signal TCLK are switching. 
Thus, the alternative structure is not as good as the stmcture shown in FIG. 2, with respect 
to the switching noise problem. 

HG. 20 is a sunplified block diagram of an embodiment of the timing recovery 
syst^ constructed according to flie present invention and ^plied to the gigabit 
transceiver architecture of FIG. 2. The tuning recovery system 222 (FIGS. 2 and 3) 
generates the difiGsrmt clo<^ signals for the exemplary gigabit transceiver shown in FIG. 2, 
namely, the sampling clock signals ACLKO, ACLKl, ACLK2, ACLK3, the receive clock 
signal RCLK, and the transmit clock signal TCLK. 

The timing recovery system 222 includes a set of phase detectors 2002, 2012, 
2022, 2032, a set of loop filters 2006, 2016, 2026, 2036, a set of numericaUy controUed 
oscillators (NCO) 2008, 2018, 2028, 2038 and a set of phase selectors 2010, 2020, 2030, 
2040, 2050, 2060. The adders 2004, 2014, 2024, 2034 are shown for conceptual 
illustration purpose only. In practice, these adders are implemented within the respective 
phase detectors 2002, 2012, 2022, 2032. The RCLK Offset is used to adjust the phase of 
the necdve clock signal RCLK in order to reduce the eflfects of switching noise on the 
san^ling operations of flxe corresponding A/D converters 216 (FIG. 2). Three of the four 
signals ACLKO Ofl&et, ACLKl Ofl&et, ACLK2 Offeet, ACLK3 Ofiset arc used to sUghtly 
adjust flie phases of flie respective san:q)ling clocks ACLKO through ACLK4 in order to 
fiffther reduce these effects of switdiing noise. The phase adjustments of the receive clock 
RCLK and ttie isampling clocks ACLKO - 3 arc not anecessary fimction of the timing 
TGcovery system 222. However, the method and system for genaating these phase 
at^ustment signals constitute another novel aspect of the present invention and will be 
described in detail later. 

Eadi of the phase d^ectors 2002, 2012, 2022, 2032 receives the corresponding ID 
component of flie 4D slicer error 42 (FIGS. 2 and 3) and the corresponding ID component 
of the 4D tentative decision 44 (FIGS. 2 and 3) fiom the decode 38 (FIG. 2) to generate a 
corresponding phase mor. The phase errors 0 through 3 are inputted to the loop filters 
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2006, 2016, 2026, 2036, respectively. The loop filters 2006, 2016, 2026, 2036 generate 
and output filtered phase enors to the NCOs 2008, 2018, 2028, 2038. The loop filters 
2006, 2016, 2026, 2036 can be of any ordar. In one embodiment, the loop filters are of 
second order. The NCOs 2008. 2018. 2028, 2038 generate phase control signals fiom the 
filtoed phase errors. The phase selectors 2010, 2020, 2030, 2040 receive corresponding 
phase control signals firom the NCOs 2008, 201 8, 2028, 2038, respectively. Each of the 
phase selectors 2010, 2020, 2030, 2040 selects one out of several phases of the multi- 
phase signal 2070 based on ttie value of the corresponding phase control signal, and 
ou^uts the corresponding sampling clock signal. In one embodiment of the invention, the 
multi-phase signal has 64 phases. 

The multi-phase signal 2070 is generated by a clock generator 2080. In the 
ratemplary onbodimrait illustrated in FIG. 20, ttie clock generator 2080 includes a crystal 
oscillator 2082, a fi^quency multiplier 2084 and an 8-phase ring oscillator 2086. The 
crystal osdilator 2082 produces a 25 MHz clock signal. The fi«quency multiplier 2084 
multiplies the fiequoicy of the 25 MHz clock signal by 40 and produces a 1 GHz clock 
signal. From tiie 1 GHz clock signal, the 8-phase ring oscillator 586 produces the 8 GHz 
64-phase signal 2070. 

The receive clock signal RCOC, which is used to clock all the circuit blocks in the 
receive clock domain {wiudi include all the digital signal processing circuit blocks in FIG. 
2), can be generated independentiy of the san^ling clock signals ACLKO throu^ 
ACLK3. HowevCT, for design efficiency, RCLK is chosrai to be related to one of the 
sampling clock signals ACLKO through ACLK3. For the exemphuy embodiment 
illustrated in FIG. 20, the receive clock signal RCLK is related to the sampling clock 
signal ACLKO. The receive clock signal RCLK is generated by inputting tiie sum of the 
phase contix)l signal ou^utted fix)m the NCO 2008 and the RCLK Ofl&et via an adder 
2042 to the phase selector 2050. Based on this sum, the phase selector 2050 selects one of 
the 64 phases of flie multiphase signal 2070 and outputs flie receive clock signal RCLK. 
Thus, whai the RCLK Ofl&et is zero, flie receive clock signal RCLK is the same as the 
san^Iing dock ACLKO. 

As discussed previously in relation to FIG. 19, when tiie constitiiait transceiver is 
configured as flie master, its transmit clock TCLK is practically independent of its receive 
clock RCLK. In FIG. 20, whm flie constituent transceiver is the master, the transmit clock 
signal TCLK is genaated by inputting flie signal TCLK O&et, generated by flie PHY 
Control system of the gigabit ti^ccivcr, to flic phase selector 2060. Based on the TCLK 
OSkei, flie phase selector 2060 selects one of flie 64 phases of tfie multi-phase signal 2070 
and produces flie tiansmit clock signal TCLK Whaitfie constituent transceiver is flie 
slave, the transmit clock signal TCLK is generated by inputting flic sum of flie output of 
flie NCO 2008 and flie signal TCLK Offset, via tiie adder 2042, to flie phase selector 2060. 
Based on fliis sum, flie phase selector 2060 selects one of flie 64 phases of flie multi-phase 
signal 2070 and produces flie tirasmit clock signal TCLK. Thus, at flie slave, tiie transmit 
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clock signal TCLK and the receive clock signal RCLK are phase-locked (as discussed 
previously in relation to FIG. 19). 

It is inqjortant to note that, referring to FIG. 20, the fiinction performed by the 
combination of an NCO (2008, 2018, 2028, 2038) foUowed by a phase selector (2110, 
2120, 2130, 2140, 2150, 2160) can be inqilemented by analog circuitry. The analog 
circuitiy can be described as follows. Each of the filtered phase errors outputted fiom the 
loop filters (2006, 2016, 2026, 2036) would be inputted to a D/A converter to be converted 
to analog form. Each of the analog filtered phase errors would them be inputted to a 
voltage^ntroUed oscillator (VCO). The VCOs would produce the clock signals. The 
VCOs can be implemented with well-known analog techniques such as those using 
varactor diodes. 

FIG. 21 is a block diagram illustrating a detailed implementation of the phase 
detectors 2002. 2012, 2022, 2032. the loop filters 2006, 2016, 2026, 2036, and the NCOs 
2008, 2018, 2028, 2038 of FIG. 20. 

It is in^oitant to note that the 4D path connecting the phase detectors 2002, 2012, 
2022, 2032, the loop filters 2006, 2016, 2026, 2036. the NCOs 2008. 2018. 2028, 2038 
and the phase selectors 2010, 2020. 2030, 2040 (FIG. 20) can be thought of as the 4D 
forward path of a phase locked loop w^ose 4D feedback path goes fiom, referring now to 
FIG. 2, the AID converters 216 to the demodulator 226 then back to the timing recovray 
222. Hie input to this phase locked loop is actually phase information embedded in the 
slicer axat A2 and tentative decision 44, and the phase locked loop output is the phases of 
the sampling clock signals. This phase locked loop is digital but can be approximated by a 
continuous-time phase locked loop for practical design analysis purpose, as long as the 
sanq>ling rate is much larger than the bandwidth of the loop. The theoretical transfer 
Amotion of a continuous-time second-order phase locked loop is: 

^j)_ K^s + K^-K^ 
where the transfer function of flie loop filter is: 



where is the gain of the voltage-controlled oscillator, is the gain of the phase 
detector, K^=K^' Kj and K, is the gain of the integrator inside the loop filtCT. For the 
digital phase lodced loop of the present invaition, die gain parameters and K, can be 
computed fiom the word lengths and scale factors used in inqjleanenting the NCO and the 
integrator of the loop filter. However, the gain of the phase detector is more 
convenioiay confuted by simulation. The gain parameters are used for the design and 
analysis of the digital phase lodced loop. 
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FIG. 21 shows a phase detector 21 10, a first filter 2130, a second filter 2150, an 
adder 2160 and an NCO 2170. The phase detector 21 10 is an exemplary embodiment of 
the phase detectors 2002, 2012, 2022, 2032 of nO. 20. The combination of the first filter 
2130, the second filto- 2150 and the adder 2160 is an exemplary embodiment of the loop 
filters 2006, 2016, 2026, 2036 of Fia 20. The NCO 2170 is an exemplary embodunent of 
the NCOs 2008, 2018, 2028, 2038 of FIG. 20. 

In FIGS. 21 througji 23, the numbers in the form "Sn.k" indicate the format of the 
data, whsre S denotes a signed number, "n** denotes the total number of bits and "k" 
d^otes the numb^ of bits after the decimal point 

The phase detector 21 10 includes a lattice structure having two delay elements 
21 12, 21 18, two multipliers 21 14, 2120 and an adder 2122. The phase detector 21 10 
receives as hiputs the corresponding ID component of the 4D sheer error 42 (HGS. 2 and 
3) and the corresponding ID componmt of the 4D tcaitative decision 44 (FIGS. 2 and 3) 
fiom the trellis decoder 38 (FIGS. 2 and 3). Forsimphcity,inFIG. 21, thesetwo ID 
components are labeled as 42A and 44A, respectively. It is understood that, for the phase 
detector of each of flie four constituent transceivers of the gigabit transceiver, a distinct ID 
component of the sheer error 42 and a distinct ID conqjonent of the tentative decision 44 
are used as ixiputs. On flie vppcr branch of ttie lattice structure, the sheer error 42 is 
delayed by one unit of time (here, one symbol period) via the delay element 21 12, then 
multipUed by the tentative decisfion 44A to produce a pre-cursor phase error 21 i 5. The 
pre-<:ursor phase error 21 15, vyhen accumulated over time, i^resents the correlation 
between a past sheer error and a present tentative decision, thus indicates the sampling 
phase error with respect to the zero-crossing point at the start of tiie signal pulse (this zero- 
crossing point is part of the pre-cursor introduced by design to the signal pulse by the 
precursor filter 28 of tiie FFE 26 in FIG. 2). On tiie lower branch of flie lattice structure, 
the tOTtative decision 44A is delayed by one unit of time via tiie delay element 2118, then 
multipHed by the sheer error 42A to produce a post-cursor phase error 2121. 

The post-cursor phase enx)r 2121, whra accumulated over time, represents tiic 
correlation between a present sheer error and a past tentative decision, flius indicates the 
sampUng phase mor with respect to the level-crossmg pomt m ttie tail ead of the signal 
pulse. In one embodiment, this level-crossing pomt is deteimmed by ttie first tap 
coefficient of ttie DFE 312 of HG. 3. At tiie zenM^rossing pomt at flie start of ttie signal 
pulse, ttie slope of the signal pulse is positive, while at ttie level-crossmg pomt at tfie tail 
end of ttie signal pulse, the slope of the signal pulse is negative. Thus, the pre-cursor 
phase error 21 15 and the post-ausor phase error 2121 must be combmed witti opposite 
signs m ttie adder 2122. The combmation of ttie pre-cursor 21 15 and post-cuiror phase 
errors 2121 produces the phase error associated vrith one of the sampling clock signals 
AGLKO - ACLK3. Hiis is the phase otot indicated as one of the phase errors 0 through 3 
in FIG. 20. 
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The phase oflQset 2 1 02 is one of the sampling clock ofl&et signals ACLKO Offeet 
through ACLK3 OflBjet in FIG. 20. The phase offset 2102, when needed, is generated by 
the PHY Control system of the gigabit transceiver. The phase ofiset 2102 is delayed by 
one unit of time then is added to the combination of the pre-cureor error 21 15 and post- 
cursor 2121 via the adder 2122 to produce an adjusted phase error. The adjusted phase 
error 2123 is stored in the delay element 2124 and ou^utted to the first filter 2130 at the 
next clock transition. The delay element 2124 is used to prevent the propagation delay of 
the adder 2122 fix>m concatenating witti the propagation delay of tiie adder 2 1 32 in the 
first filter 2130. 

The first filter 2130, termed "phase accumulator", accumulates the phase raror 
2125 outputted by the phase detector 610 over a period of time then outputs the 
accumulated result at the end of the period of time. In the exemplary embodiment shown 
in HG. 21, this period of time is 16 symbol periods. The first filter 2130 is an 
"accumulate-and-dump" filter which includes the adder 2132. a delay element (i.e., 
register) 2134, and a 16-units-of-time register 2136. The register 2126 outputs a lowpass 
filtered phase error 2127 at the rate of one per period of flie TRSAMPO 2104 clock, that is. 
one every 16 symbol periods. When the register 2126 outputs the lowpass filtered phase 
error 2127. the register 2134 is cleared and the accumulation of phase error 2125 restarts. 
It is noted that, downstream from the register 2126. circuits are clocked at one sixteenth of 
the symbol rate. 

The filtered phase enor 2137 is inputted to a multipher 2140 where it is multiplied 
by a fector diflferent than 1 when it is desired that the bandwidth of the phase locked loop 
be different than its normal value (which is detemimed by the design of the filter), hi the 
exemplary embodiment depicted in HG. 21, filtered phase error 2137 is multipUed by the 
value 2 outputted fiom a multiplexer 2142 when the select signal 2106 indicates that the 
loop filter bandwidth must be larger than normal value. This occurs, for example, during 
startup of the gigabit transceiver. Similarly, although not shown m FIG. 21, when it is 
desired that the loop filter bandwidth.be narrower than normal value, the filtraed phase 
enor 2137 can be multiplied by a value less than 1. 

The output 2144 of the multipUer 2140 is inputted to the second filter 2150 which 
is an integrator and to the adder 2160. The integrator 2150 is an Iffi. filter having an adder 
2152 and a register 2154, operating at one sixteenth of the symbol rate. The integrator 
2150 integrates the signal 2144 (which is essentiaUy the filtered phase error 2137) to 
produce an integrated phase error 2156. The purpose of the phase locked loop is to 
generate a resulting phase for a sampling clock signal such that the phase error is equal to 
zero. The purpose of the integrator 2150 in the phase locked loop is to keep the phase 
eror of the resulting phase equal to zero even when fliae is static frequent^ error. 
Without the integrator 2150, the static frequency error would result in a static phase error 
which would be attenuated but not made exactty zero by the phase locked loop. With the 
infa^rator 2150 in the phase lodced loop, any static phase error would be integrated to 
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produce a large growing iiq>ut signal to the NCO 670, which would cause the phase locked 
loop to correct the static phase error. The integrated phase error 2156 is scaled by a scale 
fector via a multiplier 2158, This scale factor contributes to the detennination of the gain 
of the integrator 2150. The scaled result 2159 is added to flie signal 2144 via an adder 
2160. 

The output 2162 of the adder 2160 is inputted to the NCO 2170. The output 2162 
is scaled by a scale factor, e,g., 2"* , via a multiplier 2172. The resulting scaled signal is 
recursively filtered by an HR filter formed by an adder 2174 and a register 2176. The HR 
filta: operates at one sixteenth of the symbol rate. The signal 2178, outputted every 16 
symbol periods, is used as the phase control signal to one of the phase selectors 2010, 
2020, 2030, 2040, 2050, 2060 (Fia 20). 

For the embodiment shown in FIG. 21, the gain parameters discussed above are as 
follows. , the gain of the NCO, is 2 " for normal bandwidth mode, 2 *** for high 
bandwidth mode. , the gain of the integrator 2 1 50, is equal to the product of the 
scaling of the mtegrator register 2154 (2** m FIG. 21) and the ratio of the phase locked 
loop sampling rate to the symbol rate (2^ in FIG. 21). For the word lengths and scaling 
indicated m FIG, 21, AT, is equal to 2 ". The gain A:^ of the phase detector 21 10 is 
computed by simulations and is equal to 2.2. These parameters are used to compute the 
theoretical transfer fimction of the phase locked loop (PLL) which is then compared with 
the PLL transfer fimction obtained by simuIatioiL The match is near perfect, confirming 
the validity of the design parameters. 

One ^bodiment of the system 2100 of FIG. 21 fiirtiier includes the external 
control signals PLLFR2; PLLPVAL, PLLPRST, PLLFVAL, PLLFRST, PLLPRAMP, 
which are not shown e)q)Licitly in FIG. 21 . 

The control signal PLLFRZ, when applied, forces the phase error to zero to point 1 
of the first filter 2130, therefore causes freezmg of updates of the frequency change and/or 
phase diange, except for any phase change caused by a non-zero value in the frequency 
register 2154 of the inte^or 2150. 

The control signal PLLPVAL is a 3-bit signal provided by the PHY Control 
systOTL It is used to specify the reset value of the NCO register 2176 of the NCO 2170, 
and is used in conjunction with the control signal PLLPRST. 

Hie control signal PLLPRST, when applied to the NCO register 2176 in 
conjunction with the signal PLLPVAL, resets the 6 most significant bits of the NCO 
register 2176 to a value specified by 8 times PLLPVAL. The reset is performed by 
stqjping up or down the 6 MSB field of the NCO register 2176 such that the specified 
value is readied after a minimum number of steps. Details of ttie phase reset logic block 
used to reset the yalue of the register 2176 of the NCO 2170 arc shown in FIG. 22 and will 
be discussed later. 

PLLFVAL is a 3-bit signal provided by the PHY Control system. It is to be 
intopreted as a 3-bit two's conq)lement signed integer in the range [-4,3]. It is used to 
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specify the reset value of the frequency register 2154 of the integrator 2150 and is used in 
conjunction with the control signal PLLFRST. 

The control signal PLLFRST, whea applied to flie frequency register 2 154 of the 
integrator 2150 in conjunction with the signal PLLFVAL, resets the ficquency register 
2154 to the value 65536 times PLLFVAL. 

The control signal PLLPRAMP loads the fixed number -2048 into the frequency 
register 2154 of the integrator 2150. This causes the phase of a sampling clock signal (and 
receive clock RCLK) to ramp at the fixed rate of -2 ppm. This is used during startup at 
the master constituent transceiver. PLLPRAMP overrides PLLFRST. In other words, if 
both PLLPRAMP and PLLFRST arc both applied, the value loaded into the frequency 
register 2154 is -2048, regardless of the value that PLLFRST tries to load. 

FIG. 22 is a bloc^ diagram illustrating the phase reset logic block 2200 to the NCO 
2170. The control signal PLLPRSTisiq)pKed to the AND gate 2202. The output of the 
AND gate 2202 is applied to die incrranent^decrement enable input of the register 2176. 
The 3-bit value PLLPVAL from the PHY Control System of the gigabit transceiver is 
shifted left by 3 bits to form a 6-bit value 2204. 

The current output of the register 2176 of the NCO 2170 (FIG. 21), which is the 
phase control signal inputted to the correspondmg phase selector (FIG. 20), is subtracted 
from this shifted value of PLLPVAL via an adder 2206. Module 2208 detemiines whether 
the ou^ut of adder 2206 is non-zero. If it is non-zero, then module 2208 outputs a "1" to 
the AND gate 2202 to enable the enable input of register 2176. If it is zero, module 2206 
outputs a zero to the AND gate 2208 to disable the enable input of the register 2176. 
Module 2210 detemiines whether the output of adder 2206 is positive or negative. If it is 

positive,module2210outputsacounttq)indicatortotheregister2176. Ifitis negative, 
module 2210 outputs a count down indicator to registo^ 2176. 

The subtraction at adder 2206 finds the shortest path from the current value of flie 
NCO regwter 2176 to the shifted PPLVAL 2204. For example, suppose the current phase 
value of register 2176 is 20. If the shifted PPLVAL 2204 (which is the desired value) is 
32, the diflFerence is 12, which is positive, therefore, the register 676 is incremented. If flie 
desired phase value is 56, the diflFerence is 36 or "lOOlOO" which is interpreted as -28, so 
the register 2176 will be decremented 28 consecutive times. The phase steps occur at the 
rate of one every 16 symbol periods. This single stepping is needed because of the way 
flie phase selector operates. The phase selector can only increment or decrement from its 
current setting. 

no. 23 is a block diagram of an exemphxy phase shifter logic block used for the 
phase confrol ofthe receive clock signal RCLK. The phase shifter logic block 2300 is 
needed when the signal RCLK Ofl&et (FIG. 20) is used to adjust the phase of the receive 
clock signal RCLK. The signal RCLK 0£&et is a 6-bit signal provided by the PHY 
Control system, and specifies the amount by vMch the phase of RCLK must shifted. 
Even if the signal RCLK Oflfeet indicates a large amount of phase shift, this phase shift 
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must be transferred to the iiqjut of the phase selector 2050 (FIG- 20) one step at a time due 
to the way the phase selector operates. The change of phase of RCLK must occur in the 
direction mdicated by a control signal STEPDIR genraated by the PHY Control system. 

The phase shifta- logic block 2300 includes a comparator 2302, an oflfeet register 
2304 and the adder 2042 (the same adder indicated in FIG. 20). The comparator 2302 
compares flie output 2306 of the oflfeet register 804 witti the signal RCLK Oflfeet. If the 
two signals are equal, then the comparator 2302 outputs a "0" to the enable iiqjut of the 
oflfeet siegister 2304 to disable ttie up/down counting of the oflfeet register 2304, thus 
keeping the output 2306 the same for the next time period. If the two signals are not 
equal, the comparator 2302 outputs a "r to the enable input of the oflfeet rcgisto- 2304 to 
enable the up/down counting, causing the output 2306 to be incremmted or decremented 
at the next time period. The signal STEPDIR fiom the PHY Control system is inputted to 
the up/down input of the oflfeet register 2304 to control the counting direction. The output 
2306 fiom the oflfeet register 2304 is added to the phase control signal 2009 produced by 
the NCO 2008 (FIG. 20) via the adder 2042 to generate the phase control signal 2049 
(FIGS. 23 and 20) for the RCLK phase selector 2050 (FIG. 20). 

The coupling of switching noise fiom the digital signal processor that implements 
the transceiver functions to each of the A/D converters is an important problem that needs 
to be addressed. Switching noise occurs whra transistors switch states in accoidance vrith 
transitions in the clock signal (or signals) that controls flieir operation. Switching noise in 
the digital section of the transceiver can be coupled to the analog section of the 
transceiver. Switching noise can cause severe degradation to the perfonnance of an A/D 
converter if it occurs right at or near the instant the A/D converter is sampling the received 
signal. The present invention, in addition to providing a timing recovery method and 
system, also provides a me&od and system for minimizing the degradation of the 
performance of the A/D converters caused by switching noise. 

The eflfect of switching noise on an A/D converter can be reduced if the switching 
noise is synchronous (with a phase delay) with the sampling clock of the A/D converter. 

in addition* it is possible to adjust the phase of the sampling clock of the A/D converter 
with req^t to the phase of the switdiing noise, then the phase of the sampling clock of 
the A/D convCTter can be optimized for minimum noise. It is noted that, for a local gigabit 
transceiver, the sanq)ling clock signals ACLKO, ACLKl, ACLK2, ACLK3 are 
syndironous to each other (i.e-, having the same fiequency) because they are synchronous 
to the 4 transmitters of the remote transceiver and these 4 remote transmitters are clocked 
by a same transmit clock signal TCLSL It is also in^Yortant to note that the local rwdve 
clock signal RCLK is synchronous to the local samplmg clock signals ACLKO, ACLKl , 
ACLK2,ACLK3. 

Refiaring to FIGS. 2 and 5, flie four A/D converters 216 of ttie four constituent 
transceivers are sampled with the sanqiling clodc signals ACLKO. ACLKl, ACIK2, 
ACLK3, Each of the phases of these sampling cloc^ signals is detomined by the 
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subsystem 600 (FIG. 21) of the timing recovery system 222 in response to the phase of the 
corresponding received signal, which dq>ends on the remote transmitter and the hne 
charactmstics. Thus, the phases of the sampling clock signals change fix)m Une to line, 
and are not under the control of the system designer. 

However, the relative phase of the receive clock signal RCLK with respect to the 
samphng clock signals AO-KO, ACLKl, ACLK2, ACLK3 can be controlled by adjusting 
the signal RCLK Ofl&et (FIG. 20). The signal RCLK Ofl&et can be used to select the 
RCLK phase that would cause the least noise coupUng to the A/D converter 216 of FIG. 
2. The underlying principle is the following. Referring to FIG. 2 and the boundaries of 
the clock domain, the aitire digital signal prrcessing, control and interface fimctions of the 
receiver operate in accordance with transitions in the receive clock signal RCLK In other 
words, most of the digital logic circuits switch states on a transition of RCLK (more 
specifically, on a rising edge of RCLK). Only a small portion of the transceiver operates 
in accordance with transitions in the transmit clock signal TCLK. Therefore, most of the 
switching noise is synchronous with the receive clock signal RCLK Since the receive 
clo(*: signal RCLK is synchronous with the san^ling clock signals ACLKO, ACLKl , 
ACLK2, ACLK3, it follows that most of the switching noise is synchronous with ttie 
sampling clock signals ACLKO, ACLKl, ACLK2, ACLK3. Therefore, if the phase of the 
receive clock signal RCLK is adjusted such that a transition in the signal RCLK occurs as 
fer as possible in time &om each of the sampling clock signals AdJCO, ACLKl, ACLK2, 
ACLK3, then the switching noise coupling to the A/D converters will be minimized. 

The process for adjusting the phase of the receive clock signal RCLK can be 
summarized as follows. The process performs an exhaustive search over all the RCLK 
phases that, by design, can possibly exist in one symbol period For each phase, the 
process computes flie sum of the mean squared errors (MSEs) of the 4 pairs (i.e., the 4 
constitumt transceivCTs). At the end of the search, the process selects the RCLK phase 
that minimizes the sum of the MSEs of the four pairs. The following is a description of 
one embodiment of the RCLK phase adjustment process, where there are 64 possible 
RCLK phases. 

FIG. 24 is a flowchart illustratmg the process 2400 for adjusting the phase of the 
receive clock signal RCLK Upon Start (block 2402), process 2400 initializes all the state 
variables (which include counters, registers), sets Ofl&et to -32 (block 2404), sets 
Min^MSE equal to the MSE of the gigabit transceiver before any RCLK phase change, 
and sets BestOfe^ equal to zero. The MSE of the gigabit transceiver is the sum of the 
mean squared errors (MSEs) of the 4 constituent transceivers. The MSE of a constituent 
transceiv@- is tiie mean squared enx)r of the corresponding ID component of the 4D sheer 
esTor 42 (FIG. 2), and is outputted by a MSE computation block 2700 (FIG. 27) for every 
fiame. Each fiame is equal to 1024 symbol periods. Ihis initialization is done within a 
duration of 1 fiame. Process 2400 then waits for the effect of the RCLK phase change on 
the system to settle (block 2406). The duration of this waiting is 5 fiames. Process 2400 
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then confutes MSB (by smnming the MSEs of all four constituent transceivers outputted 
by the corresponding MSB computation block 2700 of FIG. 27) which corresponds to the 
currmt setting of RCLK Ofifeet (block 908). The duration of block 2408 is one frame. In 
block 2410, process 2400 compares the new MSB with Min_MSE. If the new MSB is 
strictly less than Min_MSE, then Min-MSE is set to the value of the new MSB and 
BestOfife^ is set to the value of O^t. In block 2412, process checks whether Of&et is 
equal to 31, i.e., whether all possible 64 phase o£&ets have been searched. If Of&et is not 
equal to 31, then process 2400 increments Ofifeet by 1 (block 2414) then continues the 
search for the best RCLK Offeet by going back to block 2406. If OfEset is equal to 3 1, that 
is, if process 2400 has searched all possible 64 phase ofifeets, then process 2400 sets Of&et 
equal to the value of BestOf&et (block 2416) then terminates (block 2418). The duration 
of each of blocks 2414 and 2416 is 1 frame. 

After adjustment of the receive clock RCLK phase, small adjustments can be made 
to the phases of the sampling clocks ACLKl, ACLK2, ACLK3 to further reduce the 
coupling of switching noise to the A/D converters. Since the timing recovery system 222 
of FIG. 20 without the ACLKO - 3 Ofl&ets, througji the phase locked loop principle, 
already sets the satnpling clocks at the optimal sampling positions with respect to the pulse 
sh^ of incoming signals from the rmiote transceivers, the small phase adjustments made 
to the sampling clocks could cause some loss of perfonnance of the A/D converters. 
However, the net result is still bettCT than performing no phase adjustment of the sanq^ling 
clocks and allowing the A/D converters to sample the incoming signals at a noisy instant 
where the transistors in the digital section are switching states. In the embodiment 
depicted in FIG. 20, phase adjustment is not made to the sampling clock ACLKO because, 
by design of the structure of the embodiment, the phase difference betweai ACLKO and 
RCLK is equal to RCLK OflEset. Thus, in this embodiment, any adjustment to ttie phase of 
ACLKO will also move RCLK away from the optimal position determined by process 
2400 above by the same amount of phase adjustment. 

FIGS. 25 A, 25B, 25C illustrate three examples of distribution of the transitions of 
clock signals within a symbol period to fiulher clarify the concq}t of phase adjustment of 
the clock signals. It is noted that, in these examples, the four sampling clock signals 
ACLKO - 3 are shown as occurring in their consecutive order within a symbol pmod for 
illustrative purpose only. It is undoistood that the sampling clock signals ACLKO - 3 can 
occur in any order. 

HG. 2SA is a first example of clock distribution where the transitions of the four 
sampling clock signals ACLKO - 3 are evmly distributed within the symbol period of 8 
nanoseconds (ns). Thus, each ACLK clock transition is 2 ns apart from an adjacent 
transition of another ACLK clock. Thwefore, for this clock distribution example, a 
transition of the receive clock RCLK can only be placed at most 1 ns away from an 
a^acCTt ACLK transition. This "distance" ^hase delay) may not be enough to reduce the 
cotq)ling of switdiing noise to the two A/D converters associated with the two adjacent 
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sampling clock signals (ACLO and ACXKO, in the example). In this case, it may be 
desirable to sUghUy adjust the phase of the two adjacent sampling clock signals to move 
their respective transitions further away fiom a RCLK transition, as iUustrated by their 
new transition occurrences withm a symbol period in FIG. 25A. 

FIG. 25B is a second example of clock distribution where the transitions of the 
four sampling clock signals ACLKO - 3 are distributed within the symbol period of 8 
nanoseconds (ns) such that each ACLK clock transition is 1 ns apart fitim an adjacent 
transition of another ACLK clock. For this clock distribution example, a transition of the 
receive clock RCLK can be positioned midway between the last ACLK transition of one 
symbol period (ACLO in ¥IG. 25B) and flie first ACLK transition of the next symbol 
period (ACLKO in FIG. 25B) so that the RCLK transition is 2.5 ns fiom an adjacent 
ACLK transition. This "distance" ^hase delay) may be enough to reduce the couphng of 
switching noise to the two A/D converters associated with the two adjacent sampling clock 
signals (ACLK3 and ACLKO, in the example). In this case, phase adjustinent of the two 
adjacent sampling clock signals to move their respective transitions further away fiom a 
RCLK transition may not be needed. 

FIG. 25C is a third example of clock distribution where the transitions of the four 
sampling clock signals ACLKO - 3 occur at the same instant within the symbol period of 8 
nanoseconds (ns). In this clodc distribution example, a transition of tiie receive clock 
RCLK can be positioned at the maximum possible distance of 4 ns fiom an adjacent 
ACLK transition. This is the best clock distribution tiiat allows maximum reduction of 
coupling of switching noise to the four A/D converters associated with the sampUng clock 
signals. In this case, there is no need for phase adjustment of the sampling clock signals. 

For the embodiment shown in FIG. 20 of the timing recovery system 222 (FIG. 2), 
tfie following phase adjustinent proems is ^plied to the three sanqiling clock signals 
ACLKl, ACLK2, ACLK3. It is understood that, in a diflf«ent embodiment of the timing 
recovery system 222 (FIG. 2) where the receive clock signal RCLK is not tied to one of 
the sampling clock signals ACLKO - 3, the following phase adjustment process can be 
^plied to all of the sampling clock signals. 

The process for adjusting flie phase of a sampling clock signal ACLBCx ("x" in 
ACLKx denotes one of 0, 1,23) can be summarized as follows. The process performs a 
search over a smaU range of phases around the initial ACLKx phase. For each phase, the 
process logs the mean squared raror MSE of tiie associated constitiient transceivers. At 
flie end of the seardi, the process selects the ACLKx phase that minimizes flie MSE of the 
associated constituent transceivCT. 

Whenevo- the phase of a Banq)ling clock signal ACLKx changes, die coefficients 
of the echo canceller 232 and of the NEXT canceUers 230 change. Thus, to avoid 
degradation of performance the phase steps of the samphng clocks should be small so that 
die change they induce on the coefficients is also smalL When tiie phase adjustinent 
requires multq>le consecutive phase steps, the convergence of the coefficients of the echo 
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canceUer 232 and of the NEXT canceUers 230 should be fest in order to avoid a buildup of 
coefGdent mismatch. 

FIG. 26 is a flowchart illustrating an ranbodimoit of the process for adjusting the 
phase of a sampling clock signal ACLKjc associated with one of the constituent 
tiansceivas, where the search is ovct a range of 1 6 phases around the initial ACLKx 
phase. For each of flie constituent transceiveis, process 2600 of FIG. 26 is run 
indqiCTdenfly of and concurrently wifli the other constituent transceivers. Upon Start 
(block 2602), process 2600 initializes aU the state variables (which include counters, 
registers), sets Ofl&et to -8 (block 2604), sets Min_MSE equal to the MSE of the 
associated constituent transceiver before any RCLK phase change, and sets BestOflfeet 
equal to zero. The MSE of the associated constituent transcdver is the mean squared error 
of the corresponding ID component of the 4D sheer error 42 (FIG. 2). This initialization 
is done within a duration of 1 frame. Process 2600 then waits for the effect of the ACLK 
phase change on the system to settle (block 2606). The duration of this waiting is 32 
frames, (block 2608). The duration of block 2608 is one frame, hi block 2610, process 
2600 compares the new MSE (outputted by the corresponding MSE computation block 
2700 of FIG. 27) which corresponds to flie current setting of ACLKx Ofi&et witii 
Min_MSE. If the new MSE is strictly less than Min_MSE, flien Min-MSE is set to the 
value of the new MSE and BestOf&et is set to the value of OfeeL hi block 2612, process 
2600 checks whether Offeet is equal to 7, i.c., whether all 16 phase effects in the range 
have been searched. If Oflfeet is not equal to 7, tfien process 2700 increments Ofl&et by I 
(block 2614) them continues the search for flie best ACLKx Ofi&et by looping back to 
block 2606. If Offeet is equal to 7, that is, if process 2600 has searched aU the 16 phase 
ofifeets in ttie range, then process 2600 sets Offeet equal to the value of BestOffset (block 
2616) then tennmates (block 2618). The duration of each of blocks 2614 and 2616 is 1 
frame. 

FIG. 27 is a block diagram of an exaiqilary implementation of the MSE 
computation block used for computing flie mean squared error of a constituent transceiver. 
In one embodiment of the gigabit transceiver, fliere are four MSE computation blocks, one 
for each offlie four constituent transcdvers. The four MSE computation blocks are run 
indq)endaitly and concurrcnfly for flie four constituent transcdvers. The MSE 
computation block 2700 includes a squaring module 2702 and an infinite impuke response 
(BR) filter 2704. The IIR filter 2704 includes an adder 2706, a feedback delay dement 
2708 and a forward deky dement 271 0. The squaring module 2702 recdves flie 
corresponding ID component of flie 4D slicer error 42 (FIG. 2), which is denoted as 42A 
for sinqilicity, and out puts tiie squared enror value to ttie filter 2704. The filter 2704 
accumulates flie squared error values by adding via tiie adder 2706 tiie current squared 
aror value to flie previous squared eaor value stored in flie feedback delay elemait 2708. 
The accumulated vahie is stored in tiie forward registw 2710. hi flie exenq)laiy 
mbodiment shovm in FIG. 27, flie squared error values are accumulated for 1024 symbol 
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periods (which is one frame of the PHY Control system). Since the accumulation period 
is sufficiently long, the accumulated value practicaUy corresponds to the mean squared 
error. At the end of the accumulation period, the clock signal 2720 fiom the PHY Control 
system clears the contents of the feedback delay element, and clocks the foiward delay 
element 2710 so that the forward delay element 2710 outputs the accumulated value MSB 
and resets to zero. 

While certain exenq)lary embodiments have been described m detaU and shown in 
the accompanying drawings, it is to be understood tiiat such embodunents are merely 
iUustrative of and not restrictive on the broad invention. It wiU flius be recognized tiiat 
various modifications may be made to the illustrated and other embodunents of die 
invention described above, without departing fitnn the broad inventive scope diereof. It 
will be understood, tfaeiefore, that flie invention is not limited to ttie particular 
embodiments or arrangements disclosed, but is rather intended to covct any cdianges, 
adaptations or modifications which are withm the scope and spirit of the invention as 
defined by the appended claims. 
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What is claimed is: 



1 . An integrated circuit communication device configured for operation over a 
multi-pair transmission channel, the communication device conqirising: 

measurement ciicuitiy configured to measure a performance degradation 
characteristic resulting from disabling each member of a set of sub-pluralities of a plurality 
of circuit elemoits; 

disabling circuitry configured to adaptively disable one or more of the sub- 
phiraHties of the circuit elements until the performance degradation characteristic reaches 
a threshold level; and 

a decision feedback sequence estimation (DFSE) circuit, the DFSE 
decoding an input sample into a final decision corresponding to a codewoni of a treUis 
code having N states, flie DFSE including; 

a decoder circuit for decoding a set of signal samples to generate 
tentative decisions and the firuil decision; and 

a single state decision feedback equalizer. 

2. The integrated circuit communication device according to claim 1 , the 
dedsion feedback equalizer coupled to the decoder circuit for receiving the tentative 
decisions, the single state decision feedback equaUzer including: 

a set of low-ordered coefficients; and 

a sei of high-ordered coefficients generating a tail value based on the 
toitative decisions and the input sample. 

3. The integrated circuit communication device according to claim 2. fiirther 
oomprismg a state multipUcation circuit, the state multipKcation circuit ejqjanding a single 
state rqiresentation of a signal received 6om the smgle state decision feedback equaUzer 
into an N state rqiresentation suitable for decoding by the DFSE. 

4. The nitegrated circuit communication device according to claim 3, the state 
multipHcation circuit comprising a multiple decision feedback equaUzer coupled to the 
decision-feedback equaUzer and generating an N state representation of signal samples in 
jespoosG to the taU value and tiie set of low-onlered coefficients received fiom the 
decision feedback equalizer. 

5. Hie int^rated circuit communication device according to claim 1 , the 
DFSE circuit fiirtfaer comprising: 
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a Viterbi decoder for receiving the set of signal san^les, the Viterbi 
decode computing path metrics for each of the N states of the trellis code and outputing 
decisions based on the path metrics; and 

a path manory module coupled to the Viterbi decoder for receiving the 
decisions, the path memory module having a number of dqsth levels corresponding to 
consecutive time instants, each of the depth levels including N registers for storing 
decisions corresponding to the N states, each of selected depth levels including a 
multiplexer for selecting a best decision from corresponding N registers, the best decision 
at the last depth level being the final decision, the best decisions at other selected depih 
levels being the tentative decisions. 

6. The integrated circuit cdnmiunication device according to claim 4, the 
midtiple decision feedback equalizer coti^rising: 

a memory; 

a set of symbolic levels contained within the memor)^ and 
a convolution engine coupled to combine the set of low order coefficimts 
with each member of flie set of symbolic levels. 

7. An integrated circuit communication device configured for operation over a 
multi-pair transmission diannel, the conununication device comprising: 

measurement circuitry configured to measure a performance degradation 
charactmstic resulting &om disabling each member of a set of sub-pluralities of a plurality 
of circuit elements; 

disabling circuitry configured to adaptively disable one or more of the sub- 
pluralities of the circuit elements until the performance degradation characteristic reaches 
a threshold level; and a single state decision feedback equalizer, 

8. Hie integrated circuit communication device according to claim 7, the 
single state decision feedback equalizer having a set of ordCTcd coefficients, the decision 
feedback equalizo: defining a coefficient related tail value and a low order subset of 
coefficifflt values 

9. The integrated circuit conununication device according to claim 8, wherein 
the single state decision feedback equalizer has a width dimension D, wherein the width 
dimension D corresponds to the number of pairs defining the multi-pair transmission 
channel. 

10. The integrated circuit communication device according to claim 9, fiirtho: 
conqirising a state miiltq)lication circuit, the state multiplication circuit sanding a single 
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State representation output signal received fiom the single state decision feedback 
equalizer into an N state representation signal suitable for decoding by the DFSE. 

1 1 . The integrated circuit conununication device acconling to claim 1 0, the 
state multiplication circuit comprising: 

a convolution engine coupled to combine flie low order subset of 
coefficient values with each membo- of a set of symboKc levels to define a fiist sample 
signal set; and 

a summmg circuit coupled to combine the tail value with each member of 
Ihe first sample signal set to define an N state representational set of signal samples. 

12. The integrated circuit communication device according to claim 7, fiirther 
comprising: 

a control module controlling activation and deactivation of at least a 
portion of the sub-pluraKtics of the circuit elements according to a criterion, the criterion 
being based on at least one of an information error metric, a power metric, a specified 
error and a specified powei^ and 

a computing module coupled to the control module, the computing module 
con:q>uting at least one of ttie information enor metric and tfie power metric. 

13. The integrated circuit communication device according to claim 10, 
wherein the critmon is the following: 

activate if the mformation error metric is greater than the specified error; 

and 

deactivate if the information error metric is smaller than the specified error. 

14. The integrated circuit communication device according to claim 13, 
wherein the aiterion is the following: 

activate if the infi>rmation error metric is greater than the specified error 
and the powCT metric is smaller than flie specified power, and 

deactivate if the information cnor metric is smaller than the specified error 
or the powor metric is greats than the specified power. 

1 5. Tbc integrated cfapuit communication device according to claim 14, 
wherein the infi)rmation error metric is related to a bit error rate of flie communication 
sy^em. 

16. An integrated circuit communication device configured for operation over a 
multi-pair transmission channel, the communication device con9)rismg: 
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a single state decision feedback equalizer having a set of ordered 
coefficiCTts, the decision feedback equalizer defining a coefficient related tail value and a 
low order subset of coefiScient values; 

a state multiplication circuit, the state multq)Ucation circuit expanding a 
single state reprcsentafion output signal received fiom the single state decision feedback 
equalizer into an N state representation signal suitable for decoding by the DFSE; 

a first ISI conq)ensation circuit receiving an input signal and ouQ^utting a 
second signal substantially compensated for a first ISI component; and 

a second ISI compensation circuit, the second ISI compensation circuit 
receiving the second signal and groerating a tiuid signal, the third signal being 
substantially compensated for a second ISI component 

17. The integrated circuit communication device according to claim 16, the 
first ISI compensation device comprising an equalizer circuit, including: 

an ISI conq>ensation filter having a substantially inverse impulse response 
to the impulse response of a pulse sluq>ing filter of a remote transmitter; and 
an ad^tive gain stage. 

18. The integrated circuit communication device according to claim 16, the 
second ISI compensation device conqsrising a decision feedback sequence estimation 
cucuit 

19. The integrated circuit communication device according to claim 18, the 
decision feedback sequence estimation circuit comprising: 

a decoder circuit receiving and decoding at least one ISI compensated 
signal sanq)le, and generating tentative decisions and a final decision; and 

a decision feedback equalizer coupled in feedback fashion to the decoder 
block, the decision feedback equalize* including a set of lowrordered coefficioits and a set 
of higji-ordered coefiBcients, the decision feedback equalizer generating a first portion of 
ISI compoisation for the second ISI component based on the tentative decisions and the 
high-ordered coefiScimts. 

20. The integrated circuit communication device according to claim 19, 
wherein the decision feedback sequence estimation circuit fiirther comprises a convolution 
mgine coupled to the decision feedback equalizer to receive values of the low-K>rdeml 
coeffidrats, the convolution engine con:q)uting a set of pre-computed values r^resenting 
a set of potential second ISI compensation portions for the second ISI componraiL ^ 
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2 1 . The integrated circuit communication device according to claim 20, 
wherein a second digital signal is combined with the first portion of ISI compensation to 
produce a third digital signal partially compensated for the second ISI component. 

22. The integrated circuit communication device according to claim 21 , 
wherein the decision feedback sequence estimation circit fiirther comprises a multiple 
decision feedback equalizer coupled to the decision feedback equalizer and the 
convolution engine, the multiple decision feedback equalizer combining the set of pre- 
computed values with the third digital signal to produce a set of potential digital signals, 
one of thcpotential digital signals being substantially compensated for the second ISI 
component 

23. The integrated circuit commimication device according to claim 22, 
wherein flie first ISI component r^resents ISI introduced by a remote transmission device, 
and wherein the second ISI component represents ISI introduced by transmission charmel 
characteristics. 

24. An integrated circtiit communication device configured for operation over a 
multi*pair transmission chatmel, the communication device comprising: 

measurement circuitry configured to measure a performance degradation 
characteristic resulting fix)m disabling each member of a set of sub-plinalities of a pluraUty 
of circuit el^fients; 

disabling circuitry configured to adaptively disable one or more of the sub- 
pluralities of the circuit elements until the performance degradation characteristic reaches 
a threshold level; and a first ISI compensation circuit configured to condensate for 

a transmitto* induced ISI componmt; and 

a second ISI compensation circuit configured to compensate for a 
transmission charmel induced ISI component 

25 . The integrated circuit conimunication device according to claim 24, the 
first ISI conq>msation device corr^irising an equalize circuit, including: 

an ISI compoisation Gltcc having a substantially inv^^ impulse response 
to flie impulse response of a pulse shying filter of a remote transmitter; and 
an adaptive gain st^e. 

26- The integrated circuit communication device according to claim 24, the 
second ISI compensation device comprising a decision feedbadc sequoice estimation 
circuit 
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27. The integrated circuit commumcatioii device according to claim 26, the 
decision feedback sequence estimation circuit comprising: 

a decoder circuit receiving and decoding at least one ISI compensated 
signal sample, and generating tentative decisions and a final decision; and 

a decision feedback equalizer coupled in feedback &shion to the decoder 
block, the decision feedback equalizer including a set of low-ordered coefficients and a set 
of high-ordered coefficients, the decision feedback equalizer generating a first portion of 
ISI compensation for the second ISI component based on the tentative decisions and the 
high-ordered coefficients. 

28. The integrated circuit communication device according to claim 27, 
M(1ierein the decision feedback sequence estimation circuit fiirther comprises a convolution 
engine coitpled to the decision feedback equalize to receive values of the low-ordered 
coefficients, the convolution engine computing a set of pre-computed values representing 
a set of potential second ISI conq)ensation portions for flie second ISI conq)onent. 

29. The integrated circuit conmiunication device according to claim 28, 
wherem a second digital signal is combined with the first portion of ISI compensation to 
produce a third digital signal partiaUy coinpensated for the second ISI cornponent 

30. The integrated circuit communication device according to claim 29, 
wherein the decision feedback sequence estimation circit fiuther comprises a multiple 
decision feedback equalizer coupled to the decision feedback equalizer and the 
convolution engine, the miiltiple decision feedback equalizer combining the set of pre- 
computed values with the third digital signal to produce a set of potential digital signals, 
one of the potential digital signals bemg substantially compensated for the second ISI 
component 

3 1 . The integrated circuit communication device according to claim 30, 
wherein the first ISI component represents ISI introduced by a remote transmission device, 
and wherein the second ISI component represents ISI introduced by transmission chaimel 
diaracteristics. 

32. The integrated circuit communication device according to claim 24, fiirther 
comprising: 

a control module controlling activation and deactivation of at least a 
portion of the sub-pluralities of the circuit elements according to a critOTon, the criterion 
being based on at Irast one of an information aror metric, a power metric, a specified 
oror and a specified power; and 
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a computing module coupled to the control module, the computing module 
computing at least one of the infomiation error metric and the power metric. 

33. The integrated circuit communication device according to claim 32, 
wherein the criterion is the following: 

activate if the information error metric is greater than the specified error, 

and 

deactivate if the information error metric is smaller than the specified error. 

34. The integrated circuit communication device according to claim 33, 
wherein the criterion is the following: 

activate if the information error metric is greater than the specified error 
and the power metric is smaller than the specified po wct; and 

deactivate if the mformation error metric is smaller than the specified error 
or the power metric is greater than the specified power. 

35. The integrated circuit communication device according to claim 34, 
wherein the information error metric is related to a bit error rBte of the communication 
system. 

36. An integrated circuit communication device configured for operation over a 
multi-pau: transmission channel, the communication device conq)rising: 

measuranent circuitry configured to measure a performance degradation 
characteristic resulting &om disabling each member of a set of sub-pluraUties of a pluraUty 
of circuit elanaits; 

disabling circuitry configured to adaptively disable one or more of the sub- 
pluralities of the circuit elements until the performance degradation characteristic reaches 
a threshold level; and 

a decoder Systran for computing the distance of a received symbolic word 
fit)m a codeword. 

37. The integrated circuit communication device according to claim 36, finther 
con^rising: 

a control module controlling activation and deactivation of at least a 
portion of the subi)luraUties of the circuit elranents accordmg to a criterion, the critmon 
being based on at least one of an information error metric, a power metric, a specified 
OTor and a specified pow^; and 

a conqjuting module coupled to the control module, the cotxxpyjting module 
conq)uting at least one of the information mor metric and the power metric. 
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38. The integrated circuit commimication device according to claim 37, 
wherein the critOTon is the following: 

activate if the information error metric is greater than the specified error; 



and 



deactivate if the information error metric is smaller than the specified error. 



39, The integrated circuit communication device according to claim 38, 
wherein the criterion is the following: 

activate if the information error metric is greater than the specified error 
and the power metric is smaller than the specified power, and 

deactivate if the information error metric is smaller than the specified error 
or the power metric is greater than the specified power, 

40. The integrated circuit communication device according to claim 39, 
wherein the information error metric is related to a bit error rate of the communication 
system. 

4L The integrated circuit communication device according to claim 36, 
configured to receive information encoded in accordance with a multi-level symbolic 
schraae and over a multi-dimensional transmission channel, flie decoder system 
comprising: 

an input, coiq>led to receive an input signal; 

a first slicer, coupled to detect the input signal with respect to a first one of 
two disjoint one^dim^isional symbol-subsets; and 

a second slicer. coupled to detect the input signal with r^ect to a second 
one of the two disjoint one-dimensional symbol-subsets; 

wherein the first slicer outputs a first decision term and a first error term 
with respect to the first one of the two diqomt one-dunensional symbol-subsets, the 
second slicer outputting a second decision term and a second error term witti respect to the 
second one of the two disjoint one-dimensional symbol-subsets; and 

wherein each of the first and second oror terms is expressed by a digital 
representation haviag substantially fewer bits flian the input signal. 

42. The symbol decoder according to claim 41, wherein each of flie first and 
second error terms rq)resmts a distance metric between the input signal and a symbol in 
the respective one of the two disjomt one-dimensional symbol-subsets. 

43. The iritegrated circuit communication device according to claim 36, 
configured to receive information encoded in accordance with a multi-level symbolic 
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scheme and over a multi-dimensional transmission channel, the decoder system 
comprising: 

an iiq)ut to receive an input signal; 

a first slicer coupled to the iiqjut, the first slicer detecting the input signal 
with respect to a first one of two disjoint one-dimensional symbol-subsets; 

a second sKcct coupled to the irq)ut, the second slicer detecting ttie input 
signal with respect to a second one of the two disjoint one-dimensional symbol-subsets; 
and 

a third slicer coupled to detect the iiqiut signal with respect to a union set of 
the two disjoint one-dimensional symbol-subsets. 

44. The integrated circuit communication device according to claim 43, 
wherein the first slicer outputs a first decision with respect to the first one of the two 
disjoint one-dimensional symbol-subsets, the second slicer ouputting a second decision 
with respect to the second one of the two disjoint one-dimensional symbol-subsets, and 
wherein the third slicer outputs a third decision with respect to the union set of the two 
disjoint one-dimensional symbol-subsets. 

45. The integrated circuit communication device according to claim 44, fiuther 
comprising: 

a first combination logic block configured to combine the first decision 
with the third decision, the first combination logic block defining a first error term; and 

a second combination logic block configured to combine the second 
decision with the third decision, the second combination logic block defining a second 
error term. 

46. The integrated circuit communication device according to claim 45, fiirther 
comprising: 

a first square error generation block configured to operate on the first error 
term so as to define a square mor represoitafion thereof; and 

a second square error generation block configured to operate on the second 
error torn so as to define a square error rq)resCTtation thereof 

47. The integrated circuit communication device according to claim 46, 
wherein each of the error terms is expressed as a digital rq)resmtation having one bit 

48. An integrated circuit commimication device configured for operation over a 
multi-pair transmission channel, the communication device comprising: 

a first ISI compensation circuit configured to compoisate for a transmitter 
induced ISI component; 
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a second ISI compensation circuit configured to compensate for a 
transmission channel induced ISI conqjonent; and 

a decoder syston for computing the distance of a received symbolic woid 
fit>m a codeword. 

49. The integrated circuit communication device according to claim 48, 
wherein the first ISI compensation circuit comprises: 

an inverse partial response filter having an impulse response substantially 
an inverse of an impulse response of a pulse shaping filter of a remote transmitter, so as to 
substantiaUy compensate an input digital signal for a fiist ISI component 

50. The integrated circuit communication device according to claim 49, 
wherein the inverse partial response filter is implemented with a characteristic feedback 
gain &ctor IC 



5 L The integrated circuit communication device according to claim 50, 
wherem the inverse partial response filter operates in accordance with a non-zera value of 
the characteristic feedback gain fector K during communication initialization and wherein 
the value of the feedback gain fiictor K is ramped down to zero after a pre-defined interval, 

52. The integrated circuit communication device according to claim 51, 
wherein the second ISI compensation circuit comprises: 

a Vitcrbi decoder configured to decode a digital signal and generate 
tentative decisions; and 

feedback equalizer circuitry coupled to the Viterbi decoder, the feedback 
equaUzer circuitry receiving the tentative decisions and combining the tentative decisions 
with a set of high-ordered coefficients to generate a first value. 

53. The integrated circuit communication device according to claim 52, 
whCTein the second ISI compensation circuit fiirther comprises: 

summing circuitiy combining the first value with a second digital signal, 
the summing circuitry outputting an intermediate signal; and 

a multiple decision feedback equalize receiving the intennediate signal and 
combining die intermediate signal witfi a set of pre-conqjuted values generated by 
combining values of a set of low-ordered coefficients with a set of values representing 
levels of a multi-level symbolic alphabet to produce a set of potential digital signals, one 
of die potential digital signals being substantially ISI compensated, die multiple decision 
feedback equalizer outputting said one of die potential digital signals to die Viterbi 
decoder. 
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54. The integrated circuit communication device according to claim 53, 
wherein the characteristic feedback gain fector K is ramped to zero after convergence of 
the decision feedback equaUzer. 

55. The integrated circuit communication device according to claim 48, the 
codeword being a concatenation of L symbols selected fiom two disjoint symbol-subsets 
X and Y, the codeword being included in one of a plurality of code-subsets, the received 
word being represented by L inputs, each of the L inputs uniquely corresponding to one of 
L dimensions, the decoder system con^rising: 

a set of slicers for producing a set of one-dimensional errors fiom the L 
inputs, each of the one^hmensional errors representing a distance metric between one of 
the L-inputs and a symbol in one of the two disjoint symbol-subsets; and 

a combining module for combining the one-dimensional errors to produce a 
set of L-dimensional errors such that each of the L-dimensional enor^j is a distance of the 
received word fiom a nearest codeword m one of the code-subsets. 

56. The integrated circuit communication device according to claim 55, 
wherein each of the one-dimensional errors is represented by substantiaUy fewer bits than 
each of the L inputs. 

57. Hie integrated circuit communication device acconling to clahn 55, 
wherein the slicers slice the L inputs with respect to each of the two diqoint symbol- 
subsets X and Y to produce a set of X-based errors, a set of Y-based errors and 
corre^onding sets of X-based and Y-based decisions, the sets of X-based and Y-based 
errors forming the set of one-dimaisional eirors, die sets of X-based and Y-based 
decisions forming the set of one-dimensional decisions, each of the X-based and Y-based 
decisions bemg a symbol in a correqjonding symbol-subset closest in distance to one of 
flie L inputs, each of the one-dimensional mors rqjresenting a distance metric between a 
corresponding one^imensional decision and one of the L iiqiuts. 

58. The integrated circuit communication device according to claim 55, 
Mliaein die set of slicers comprises: 

first slicas for slicing each of the L inputs with respect to each of the two 
disjoint symbol-subsets X and Y to produce a set of X-based decisions and a set of Y- 
based decisions, the sets of X-based and Y-based decisions formmg the set of one- 
dimensional decisions, each of the X-based and Y-based decisions being a symbol in a 
correqionding symbol-subset closest in distance to one of the L iiqjuts; 

. second slicers for slicing each of the L irqiuts with respect to a symbol-set 
comprising all symbols of die two disjoint symbol-subsets to produce a set of hard 
decisions; and 
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OTor-conqjuting modules for combining each of the sets of X-based and Y- 
based decisions with the set of hard decisions to produce the set of one^iimensional errors, 
each of the one-dimensional errors representing a distance metric between the 
corresponding one-dimensional decision and one of the L inputs. 

59. The integrated circuit communication device according to claim 55. 
wherein the combining module con^rises: 

a first set of adders for combining the one-dimensional errors to produce 
two-dimensional errors; 

a second set of adders for combining the two-dimensional errors to produce 
intermediate L-dimensional errors, the intermediate I^dimensional errors being arranged 
into pairs of errors such that the pairs of errors correspond one-to-one to the code-subsets; 
and 

a minimum-select module Eor detennining a minimum for each of the pairs 
of errors, the minima being the L-dimensional errors. 

60. An integrated circuit communic^on device configured for operation over a 
multi-pair transmission charmel, the communication device comprising: 

a decision fisedback sequence estimation (DFSE) circuit, for decoding an 
irq>ut sample into a final decision corresponding to a codeword of a trellis code having N 
states, the DFSE including a single state decision feedback equalizer; and 

a decoder system for computing the distance of a received symbolic word 
fi:om a codeword. 

61. The integrated circuit conmiunication device according to claim 60, the 
single state decision feedback equalizer having a set of ordered coefScients, the decision 
feedback equalizer defining a coefficient related tail value and a low order subset of 
coefficient values 



62. The integrated circuit communication device according to claim 6 1 , 
wherein the single state decision feedback equalizer has a width dimmsion D, wherein the 
width dimension D corresponds to the number of pairs defining the multi-pair 
transmission charmeL 

63, The integrated circuit communication device according to claim 62, fiirther 
conrprising a state multiplication circuit, the state multiplication circuit ejqjanding a single 
state representation output signal received fiom the single state decision feedback 
equalizer into an N state representation signal suitable for decoding by the DFSE. 
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64. The integrated circuit communication device according to claim 60, 
configured to receive infonnation mcoded in accordance with a multi-level symbolic 
scheme and over a multi-dim^ional transmission channel, the decoder system 
conqirising: 

an input, coupled to receive an input signal; 

a first slicer, coupled to detect the input signal with respect to a first one of 
two disjoint one-dimensional symbol-subsets; and 

a second slicer, coupled to detect the ifq5ut signal with respect to a second 
one of the two disjoint one-dimensional symbol-subsets; 

wherein the first slicer outputs a first decision term and a first error term 
wifli respect to the first one of the two disjoint one-dimensional symbol-subsets, the 
second slicer outputting a second decision term and a second error term with respect to the 
second one of the two disjoint one-dimensional symbol-subsets; and 

wherein each of the first and second error terms is e3q)ressed by a digital 
representation having substantially fewer bits than the input signal. 

65. The symbol decoder according to claim 64, wherein each of the first and 
second error ieams repres^ts a distance metric between the input signal and a symbol in 
the respective one of the two disjoint one-dimensional symbol-subsets. 

66. The integrated circuit communication device according to claim 60, 
configured to receive infonnation encoded in accordance with a multi-level symbolic 
scheme and over a multi-dimensional transmission channel, the decoder system 
comprising: 

an ii^iut to receive an input signal; 

a first slicer coupled to the input, the first slicer detecting the input signal 
with respect to a first one of two disjoint one-dimensional symbol-subsets; 

a second slicer coupled to the mpnU the second slicCT detecting the input 
signal with respect to a second one of the two disjoint one-dimensional symbol-subsets; 
and 

a third slicer coiq)led to detect the input signal with respect to a union set of 
file two disjoint one-dimensional symbol-subsets. 

67. The integrated circuit communication device according to claim 66, 
xisiiCTein the first slicer outputs a first decision with respect to the first one of the two 
disjoint one-dimensional symbol-subsets, the second slicer ouputting a second decision 
with respect to flie second one of the two disjoint one-dimensional symbol-subsets, and 
M^ierein the third slicer outputs a third decision with respect to the union set of the two 
disjoint one-dimmsional symbol-subsets. 
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68. Hie integrated circuit communication device according to claim 67, fiirther 
conq)iising: 

a first combination logic block configiued to combine the first decision 
with the third decision, the first combination logic block defining a first error temi; and 

a second combination logic block configured to combine the second 
decision with the third decidon, the second combination logic block defining a second 
error tmn. 

69. The integrated circuit communication device according to claim 68, fiirther 
comprising: 

a first square error generation block configured to operate on the first error 
term so as to define a square error representation thereof; and 

a second square error generation block configured to operate on die second 
mor term so as to define a square error representation thereof, 

70. The integrated circuit communication device according to claim 69, 
wherein each of the error terms is expressed as a digital representation having one bit 

71 . An integrated circuit communication device configured for operation over a 
multi-pair transmission channel, the communication device comprising: 

a decision feedback sequence estimation (DFSE) circuit, for decoding an 
iiq)ut sample into a final decision corresponding to a codeword of a trellis code having N 
states, die DFSE including a single state decision feedback equalizer, 

a first ISI compensation circuit configured to compensate for a transmitter 
induced ISI component; 

a second ISI con^>rasation circuit configured to compensate for a 
transmission channel induced ISI component; and 

ad£^tive circuitry for reducing power consumption of a filter, the filter 
having an initial set of active coefficients, an input and an ou^ut, the active coefficients 
being ordered, a lowest ordered active coefficient of the initial set being proximal to the 
input, each of the active coefficients having a stable value. 

72. The integrated circuit commimication device according to claim 71, further 
comprising: 

a threshold module generating a thr^hold; 

a comparing module coupled to the tiireshold module, tiie comparing 
module conq)aring an active coefficient with the threshold; and 

a decision module coupled to the comparing module, the decision module 
deactivating the active coefficient according to a cntcnoxL 
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73. The integrated circuit coxnmimication device according to claim 72, 
wherein the decision module deactivates the active coeflBcient if the active coefiBcient has 
a value small«- dian the threshold. 



74. The integrated circuit coimnunication device according to claim 73, further 
comprising: 

a buffer providing a specified error, 
an error computing module computing a error metric; and 
a second comparing module coi^led to the buffer, the error computing 
module and the threshold module, the second comparing module comparing the error 
metric with the specified error and produciug a first control signal to the threshold module 
when the error metric is smaller tfian the specified oror and a second control signal to the 
direshold module when the error metric is larger than the specified error. 

75. The integrated circuit communication device according to claim 74, 
wherein flie threshold module updates the threshold upon reception of the first or second 
control signal. 



76. The integrated circuit communication device according to claim 70, 
wherein the first ISI compensation circuit con^jrises: 

an inverse partial response filter having an impulse response substantially 
an inverse of an impulse response of a pulse shaping filter of a remote transmitter, so as to 
substantially compensate an input digital signal for a first ISI component 

77. The integrated circuit commxmication device according to claim 76, 
whCTein the inverse partial response filter is implemented with a characteristic feedback 
gam factor K, 

78. The integrated circuit communication device according to claim 77, 
A^erem flie inverse partial response filter opiates in accordance with a non-zero value of 
the diaracteristic feedback gain fiictor K during communication initialization and wherein 
the value of the feedback gain factor K is ramped down to zero after a pre-defined interval. 

79. The integrated circuit communication device according to claim 78, 
wherein the second ISI coixq>CTsation circuit comprises: 

a Viterbi decoder configured to decode a digital signal and generate 
tentative decisions; and 

feedback equalizer circuitry coupled to the Viterbi decoder, the feedback 
equalizer circuitry receiving the tentative decisions and combining the teatative decisions 
with a set of high-ordered coefficients to genCTate a first value. 
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80. The integrated circuit communication device according to claim 79, 
wherein the second ISI compensation circuit further comprises: 

siunming drcuiby combining the first value with a second digital signal, 
the summing circuitry ou^utting an intennediate signal; and 

a multiple decision feedback equalizer receiving the intermediate signal and 
combining the intomediate signal with a set of pre-computed values generated by 
combining values of a set of low-ordered coefficients with a set of values rq^resenting 
levels of a multi-level symbolic alphabet to produce a set of potential digital signals, one 
of the potential digital signals being substantially ISI compensated, the multiple decision 
feedback equalizer outputting said one of the potential digital signals to the Viterbi 
decoder. 

8 1 . The integrated circuit communication device according to claim 80, 
wherein the characteristic feedback gain fector K is ramped to zero after convergence of 
the decision feedback equalizer. 

82. The integrated circuit commxmication device according to claim 70, the 
single state decision feedback equalizer having a set of ordered coefficients, the decision 
feedback equalizer defining a coefficient related tail value and a low order subset of 
coefficient values 

83. The integrated circuit communication device according to claim 82, 
wherein the single state decision feedback equalizer has a width dimension D, wherein the 
width dimension D corresponds to the number of pairs defining the multi-pair 
transmission channel. 

84. Tlic integrated circuit conunuiucation device according to claim 83, further 
comprising a state multiplication circuit, the state multiplication circuit expanding a single 
state Tq>resentation output signal received from the single state decision feedback 
equalizer into an N state representation signal suitable for decoding by the DFSE. 

85. The integrated circuit communication device according to claim 84, the 
state multiplication circuit con^)ri6ing: 

a convolution engine coupled to combine the low order subset of 
coefficioit values with each mmxher of a set of symbolic levels to define a first sample 
signal set; and 

a summing circuit coupled to combine the tail value with each member of 
the first sample signal set to define an N state rq)resentational set of signal samples. 
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86. An integrated circuit communication device configured for operation ova- a 
multi-pair transmission channel, the communication device comprising: 

measurement circuitry configured to measure a p>erformance degradation 
characteristic resulting 6om disabling each manbo- of a set of sub-pluralities of a plurality 
of circuit elements; 

disabling circuitry configured to ad^tively disable one or more of the sub- 
pluraUties of the circuit elements untU the perfomiance degradation characteristic reaches 
a ttueshold level; 

a single state decision feedback equalizer; 

a first ISI con^>ensation circuit configured to compensate for a transmitter 
induced ISI con^onent; 

a second ISI compensation circuit configured to compensate for a 
transmission charmel induced ISI component; and 

a decoder system for computing the distance of a received symbolic word 
fiom a codeword* 

87. A metiiod for providing an irqjut signal to a symbol decoder, the irqjut 
signal being based on a tail value and a subset of coeflacient values received fiom a 
decision feedback equalizer, the method comprising flie operations oft 

(a) genoBting a set of pre-computed values in a multiple decision 
feedback equalizer, the pre-computed values bemg based on the subset of coefficient 
values; 

(b) combining each ofthe pre-computed values with the tail value to 
generate a tentative sample; and 

(c) selecting one of the tentative samples as the input signal to the 
symbol decoder. 

88. The method of claim 87 fiirther comprising the operations of: 

(d) storing the tratative samples in a set of registers; and 

(e) outputting the stored tentative samples to a multiplexer, the 
multiplexer selecting one of the taitative sanq>les as the input signal to the symbol 
decodo", 

wherein the operation of storing the tentative samples in the registers before 
providing the tentative sanqjles to the multiplexer facilitates higji-speed operation by 
breaking up a oitical pafli of computations into substantially balanced first and second 
portions, tire first portion including conqmtations in the decision-feedbadc equalizer and 
the multiple decision feedback equalizer, the second portion including computations in the 
sjmibol decode. 
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89. The method of claim 88 fiirthCT including the operations of saturating the 
tmtative samples and providing the saturated tentative samples to the registers. 

90. The method of claim 87 \^erein tfie decision-feedback equalizer conq^rises 
a set of ordered coefficients and the subset of coefficient values comprises values of first 
two lowest ordered coefficients of the decision-feedback equalizer. 

91. The method of claim 87 herein the prc-computed values are based on the 
subset of coefficient values and values representing levels of a multi-level a^habet 

92. The method of claim 87 ^^terein in operation (c) said one of the tentative 
samples is selected based on a symbol received &om the symbol decoder. 

93. A multiple decision feedback equaliza* for providing an input signal to a 
decoder, the multiple decision feedback equalizer receiving a tail value and a subset of 
coefficient values fiom a decision-fisedback equalizer, the multiple decision feedback 
equalizer comprising: 

(a) aconqiutingmoduleforgeneratingaset of pre-computed values 
based on the subset of coefficient values; 

(b) a set of adders coupled to the confuting module, the adders 
corresponding one-to-one to the pre-computed values, each of the adders combining the 
corresponding prc-computed value with the tail value to generate a tentative sample; and 

(c) a multtplexo* coiq>led to the adders, the multiplexer receiving the 
tentative san:q)les, selecting and outputting one of the tentative samples as the input signal 
to the decoder. 

94. The multiple decision feedback equalizer of claim 93 further comprising: 

(d) a set of registers coupled to the set of adders to receive the tentative 
samples, the registers storing the tentative samples and providing ttie tentative samples to 
the multiplex^ 

Mli^ein the locations of the registers facilitate high-speed operation by breaking up a critical 
path of computations into substantially balanced first and second portions, the first portion 
including computations in the decision-feedback equalizer and the midtiple decision feedback 
equalizer, the second portion including computations in the decoder. 

95. The multiple decision feedback equalize of claim 94 furdiCT including a set 
of saturators coiq)led to the set of adders, the saturators saturating the tentative sanq>les and 
providing the saturated tentative samples to the regist^s. 
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96. The multiple decision feedback equalizer of claim 93 wherein the decision- 
feedback equalizer has a set of ordered coefficients and the subset of coefficient values 
comprises values of first two lowest ordered coefficients of the decision-feedback equalizer. 

97. The multiple decision feedback equalizer of claim 93 wherein the pre- 
computed values are based on the subset of coefficient values and known symbol values. 

98 . The multiple decision feedback equalizer of claim 93 wherein the multiplexer 
selects said one of the tentative samples based on a symbol received bom the decoder. 

99. A metiiod for decoding an input sample into a final decision corresponding to 
a codeword of a treUis code having N states, tixe method comprising the operations of: 

(a) decoding a set of signal samples to generate toitative decisions and the 
final decision using a decoder block; 

(b) providing the tentative decisions to a decision-feedback equaUzer, the 
decision feedback equalizo- having a set of low-ordered coefficients and a set ofhigh-ordered 
coefficients; 

(c) generating from the decision-feedback equalizer a tail value based on 
the tentative decisions, values of the high-ordered coefficioits and the input sample; 

(d) providing the ttdl value and values of the low-ordered coefficients to 
a miiltiple decision feedback equalizer; and 

(e) generating from the multiple decision feedback equaUzer the set of 
signal samples for the decoder. 

100. The method of claim 99 wherein operation (a) con^rises: 

(1) providing the set of signal samples to a Viterbi decoder, 

(2) computing path metrics for each of theN states of the trellis code based 
on the set of signal samples; 

(3) computing decisions correspoiiding to the N states based on the path 

metrics; and 

(4) providing the decisions to a path memory module, the path memory 
module having a mmiber of dq>th levels corresponding to consecutive time instants, each of 
the dqith levels having N regist^s corresponding to the N states of the trellis cckie; 

(5) iq)dating decisions stored in ttie registers of all depth levels; and 

(6) selecting a best decision at each of selected depth levels, the best 
decision at tiie last depth level being the final decision, tiie best decisions at other selected 
d^tii levels being the tentative decisions. 

101. The method of claim 99 wherein the input sanq)le is an L-dimraisional sample. 
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102. The method of claim 99 wherein each of the tentative decisions and the final 
decision corresponds to a codeword of the trelUs code. 

103. The method of claim 99 Miierein the decision-feedback equalizer includes a 
delay line and wherein operation (b) con^rises the operation of inputting the tentative 
decisions into the decision-feedback equalizer at various locations of the delay line. 

104. The method of claim 100 wherein operation (f) comprises the operation of 
selecting the best decisions at tiie first three depth levels of the path memory module as the 
tentative decisions. 

105. Themethodofclaim99 wherein the set of low-ordered coeflScients comprises 
the first two coefficients of the decision-feedback equalizer. 

106. The methiod of claim 99 wherein operation (e) comprises the op^tions of: 

( 1 ) generating a set of pre-conqjuted values based on the values of the low- 
ordeiied coefficients; 

(2) combining each of the pre-computed values with the tail value to 
generate a tentative sanq)le using a set of adders; and 

(3) selecting one of the tentative samples as one of the signal samples 
using one of N multiplexa:5 corresponding to the N states. 

107. The method of claim 106 wherein operation (e) fiuther comprises the 
operations of: 

(4) storing the tentative samples in a set of registers; and 

(5) outputting the stored tentative samples to the multiplexer ; 
wdierein ttie operation of storing the tentative samples in the registers before 

providing the tmtative samples to the multiplexer facilitates high-speed operation by breaking 
up a critical path of computations into substantially balanced first and second portions, the 
first portion including conq>utations in ttie decision-feedback equalizer and the multiple 
decision feedback equalizer, the second portion including conq)utations in the decoder. 

108. The method of claim 107 wherein the multiple decision feedback equalizer 
fiirther comprises a set of saturators coupled to the set of adders to receive the t^tafive 
samples, the saturators saturating the tentative samples and providing the saturated tentative 
Kinq>les to the registers. 

109. Themethod of claim 106 wherein the pre-computed values are computed based 
on values of flic set of low-ordered coefficimt and known symbol values. 
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1 10. The method of claim 106 wha:iefai, m operation (3), for each of the N states, 
one of the received tentative samples is selected by the corresponding multiplexer based on 
at least one decision ou^utted fiom the decoder. 

111. A decision-feedback sequence estimation block for decoding an input sample 
into a final decision corresponding to a codeword of a trellis code having N states, the 
decision-feedback sequence estimation block comprising: 

(a) a decoder block decoding a set of signal samples to generate tentative 
decisions and the final decision; 

(b) a decision-feedback equalize coupled to the decoder block for 
receiving the tentative decisions, the decision feedback equalizer having a set of low-ordered 
coefficients and a set ofhigh-ordered coefficients, the decision-feedback equaUzer generating 
a tail value based on the tentative decisions, values of the high-ordered coefficients and the 
input sample; and 

(c) a midtiple decision feedback equalizer coupled to the decision-feedback 
equalizer for receiving the tail value and values of the low-ordered coefficients, the multiple 
decision feedback equalizer generating the set of signal samples for the decoder block. 

1 12. The decision-feedback sequence estimation block of claim 111 wherein the 
decodo: block comprises: 

(1) a Viterbi decoder for receiving the set of signal samples, the ViteAi 
decoder computing path metrics for each of the N states of the trellis code and outputs 
decisions based on the path metrics; and 

(2) apathmemory modulecoupledto the VitCTbi decoderforreceivingthe 
decisions, the path memory module having a munber of depth levels corresponding to 
consecutive time instants, each of the depth levels including N registers for storing decisions 
corresponding to the N states, each of selected dq)th levels including a multiplexer for 
selecting a best decision from corresponding N registers, the best decision at the last depth 
level being the final decision, the best decisions at other selected depth levels being the 
tentative decisions. 

1 13. The decision-feedback sequence estimation block of claim 111 wherein the 
input sample is an L-dimensional sanq)le. 

114. Ttie decision-feedback sequoice estimation block of claim 111 whensin each 
of the tentative decisions corresponds to a codeword of the treUis code. 

115. The decision-feedback sequence estimation block of claim 111 wherein the 
decision-feedback equalize includes a delay line and the tentative decisions are ir^utted into 
the decision-feedback equalizer at various locations of the delay line. 
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116. The decision-feedback sequmce estimation block of claim 1 12 wherein the 
tentative decisions are generated fiom the first three depth levels of the path memory module. 

117. The decision-feedback sequence estimation block of claim 111 \Jv1iereinthe set 
of low-ordered coefficients conqmses the first two coefficients of the decision-feedback 
equalizer. 

118. The decision-feedback sequence estimation block of claim 111 wherein the 
multiple decision feedback equalizer comprises: 

(1) a con:q)uting module for generating a set of pre-computed values based 
on the values of tiie low-ordered coefficients; 

(2) a set of add^ coiqiled to the conq>uting module, the add^ 
corresponding one-to-one to the pre-computed values, each of the adders combining the 
corresponding pre-computed value with the tail value to genraate a tentative sample; and 

(3) N multiplexers corresponding to the N states of the trellis code, each 
of the N multiplexers being coupled to the adders for receiving the tentative samples, each of 
the N multiplexers selecting and outputting one of the received tentative samples as one of 
the signal samples to the decode block. 

119. The decision*feedback sequence estimation block of claim 118 wdierein the 
multiple decision feedback equalizer fiarth^- comprises: 

a set of registers coupled to the set of adders to receive the tentative samples, 
flic registers being located between the adders and each of the N multiplexers, the registCTS 
providing the tentative sanqiles to the multiplexers; and 

\^erein the locations of the registers facilitate high-speed operation by 
breaking up a critical pafli of computations into substantially balanced first and second 
portions, the first portion including computations in the decision-feedback equalize and the 
multiple decision feedback equalizer, the second portion including computations in the 
decoder blodc 

120. The decision-feedback sequence estimation block of claim 1 19 wherem the 
multq)le decision feedback equalizer fiirther comprises a set of saturators cotq)led to the set 
of adders to receive the tentative san:q}les, the saturators saturating the tentative samples and 
providing the saturated tmtative samples to the registers. 

121 . The decision-feedback sequence estimation block of claim 118 wherein the 
pre-computed values are based on values of the set of low-ordered coefficimts and known 
symbol values. 
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1 22. The decision-fieedback sequence estimation blodc of claim 118 wherein each 
of the N multiplexers selects one of the received tentative samples based on at least one 
decision received fix>m the decoder block. 

123. A receiver for demodulating a received signal, the received signal resulting 
&om information signals being encoded in accordance with a multi-state ^coding scheme 
and modulated in accordance widi a multi-level modulation alphabet before transmission, the 
receiver comprising: 

a symbol decoder receiving a set of ISI compensated signal sanq>ies 
representing the received signal and correspondmg to the plurahty of states of the encoding 
schone, the symbol decoder evaluating the signal samples in accordance with the multi-level 
modulation alphabet, decoding die evaluated signal sanq>les in accordance with the multi-state 
encoding scheme, and outputting tentative decisions; and 

an ISI conqjensation circuit cotq>led to the symbol decoder to receive the 
tentative decisions, the ISI compensation cncuit comprising a single decision feedback 
equalizer, the single decision feedback equalizer providing an ISI compCTsation to the 
received signal based on tentative decisions and providing the set of ISI comprasated signal 
samples to the symbol decoder. 

124. The receiver according to claim 123, wherein the symbol decoder corr^rises: 
a symbol detector receiving the set of ISI compensated signal sanq^les, 

computing path metrics for each of the states of the multi-state oicoding scheme and 
outputting decisions based on the path metrics; 

a patii memory module coupled to the symbol detector to receive decisions, 
the path memory module including a plurality of sequential registers arranged in sets such that 
each of the sets of registers corresponding to a respective one of consecutive time intervals 
and, in each of the sets of regist^, the registers storing decisions corresponding to respective 
ones of the states of the multi-state encoding scheme; and 

a decision module selecting a best decision fiom a corresponding set of the sets 
of registois, the best decision from a distal set of the sets of registQ:s defining a final decision, 
tiie best decision fi^m an intermediate set of die sets of regist^ defining a tmtative decision. 

125. The receiver according to claim 124, wfaoiein the single decision feedback 
equalizer receives a selected set of tentative decisions fiom the path monoiy module, die 
decision feedback equalizer having a set of low-ordered coefficients and a set ofhig}i*ordered 
coefficients, the decision feedback equalize genemting a first ISI component based on the 
selected set of tentative decisions and the set of high-ordered coefficients. 

126. The lecdvo* according to claim 125, \>^erein the ISI conqj^isation circuit 
fintfaer conqnising a multiple decision feedback equalize, including a convolution engine. 
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coiQ>led between the decision feedback equalizer and the symbol decoder, the convolution 
engme receiving low-oidwed coefficients ftom the decision feedback equalizer and combining 
the low-ordered coefficients with values corresponding to levels of the multi-level modulation 
alphabet to produce a second ISI component 

127. The receiver according to claim 126, wherem the ISI compensation dicuit 
combines the first and second ISI components with the received signal to generate the ISI 
compensated signal sanq)les* 

128. The receiver according to claim 123 wherein the receiver is implemcaited as 
an integrated circuit 

129. A method for ISI conq)ensating and decoding a received signal, the received 
signal resulting fiom information signals being mcoded m accordance wifli a multi-state 
encoding scheme and modulated in accordance with a multi-level modulation alphabet before 
transmission, the method comprising the operations of: 

receiving a set of ISI compensated signal samples represcntmg the received 
signal with ISI substantially cancelled; 

evaluating the ISI conq)ensated signal san^les m accordance with the multi- 
level modulation alphabet and with the multi-state encoding scheme in a symbol decoder, 

outputting tentative decisions firom the symbol decoder; 

generating the ISI compensated signal samples fit>m the received signal and 
the toitative decisions using a single decision feedbadc equalizer; and 

providing the ISI conq>msated signal samples to the symbol decoder, 

130. The method according to claim 129, wherein the single decision feedback 
equalize mcludes a set of low-ordered coefficients and a set ofhigh-ordered coefficients, and 
wherein the operation of gmo^ting the ISI compensated signal samples comprises: 

combining the tentative decisions with values of the set of high-ordered 
coefficioits to produce a partial ISI con:q>oncnt; and 

combining the partial ISI con^jonent with die received signal to generate a 
partially ISI compensated intermediate signal 

131. The method according to claim 130, \^a:ein the operation of generating the 
ISI coxxspmsdted signal sanq>les further comprises: 

providing values of flie low-ordered coefficients to a multiple decision 
feedback equalize; 

combining the values of the low-ordered coefficients with a set of values 
rqiresmting levels of the multi-level modulation alphabet to generate a plurality of 
convolution result tmns; and 
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combining the partially ISI compensated intermediate signal with each of the 
plurality of convolution result tenns to produce a set of potential signal samples for each of 
a plurality of states of the multi-state encoding scheme, the sets of potential signal samples 
corresponding one-to-oneto the states of the multi-state encoding scheme. 

132. The mc^od according to claim 13 1, wherein the op^on of gaieiating the 
ISI compensated signal sanq>les further comprises: 

selecting for each of the states of the multi-state encoding scheme one of the 
potential signal samples included in a correspondmg set of potential signal samples, the 
selected potential signal samples being the ISI conq)ensated signal samples. 

133. The method according to claim 132, wherein the operation of evaluating the 
ISI conqiensated signal samples comprises: 

slicing the ISI compensated signal samples in accordance with fte multi-level 
modulation alphabet; 

computing path metrics for each of the states of the multi-state encoding 

schone; 

computing decisions corresponding to each of tiie states of ttie multi-state 
encoding scheme based on the path metrics; 

storing the decisions in a path memory module, the path memory module 
havinganumberofdepthlcvels correspondmg to consecutive time intravals, each of the dq)th 
levels having a number of registers corresponding to the number of states of the multi-state 
encoding scheme; 

updating decisions stored in tiie registers of all depth levels; and 

selecting a best decision at each of selected dq)th levels, flie best decision at 

the last dq)th level defining a final decision, the best decisions at othw selected dq)fli levels 

defining tentative decisions. 

134. The m^od according to claim 133, vrfierein the multi-state encoding scheme 
is a treUis code. 

135. The method according to claim 134, wherein the multi-level modulation 
a^)habet corresponds to a five-level Pulse AmpUtude Modulation consteUation. 

136. The method according to claim 135, wherein the method is performed in a 
bidirectional communication systan transmitting and receiving signals over an Ethernet 
commuiiication channel, flie communication diannel being charactaized by fom^twisted wire 
pairs of cable, disposed in parallel configuration, each of flie four twisted wire pairs of cable 
tianq>Qiting signals. 
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137. A method for computing a distance of a received word from a codeword, Ae 
codeword being a concatenation of L symbols selected fix)m two disjoint symbol subsets X 
and Y, the codeword being included in one of a plurality of code-subsets, the received word 
being represented by L inputs, each of the L inputs imiquely corresponding to one of L 
dimensions, the method conr^rising the operations of: 

(a) producing a set of one-dimensional errors from the L ii^uts, each of 
the one-dimensional errors representing a distance metric between one of the L inputs and a 
symbol in one of the two disjoint symbol-subsets; and 

(b) combining the one-dimensional errors to produce a set ofL-dimensional 
errors such that each of the L-dimensional errors is a distance of the received word from a 
nearest codeword in one of the code-subsets. 

138. The method of claim 137 wherein each of the one-dimensional errors is 
represented by substantially fewer bits Hbsai each of the L inputs. 

139. The method of claim 137 wherein operation (a) comprises the operation of 
slicing each of the L inputs with respect to eadi of the two disjoint symbol-subsets X and Y 
to produce a set of X-based errors, a set of Y-based errors and corresponding sets of X-based 
and Y-based decisions, the sets of X-based and Y-based errors forming the set of one- 
dimensional errors, the sets of X-based and Yrbased decisions forming the set of one- 
dimensional decisions, each of the X-based and Y-based decisions being a symbol in a 
corresponding symbol-subset closest in distance to one of the L inputs, each of the one- 
dimensional errors rq)resenting a distance metric between a corresponding one-dimensional 
decision and one of the L inputs. 

140. The method of claim 139 wherein each of the one-dimensional errors is 
represented by 3 bits* 

141. The method of claim 139 wherein the operation of slicing is performed via a 
look-up table. 

142. The method of claim 141 wherein the look-up table is implemented using a 
read-ordy-memory storage device. 

143. The method of claim 141 wherein the look-up table is inq)lemented using a 
random-logic device. 

144. The method of claim 137 wherein oporation (a) comprises the operation of: 
(1) slicing each of the L inputs with respect to each of the two disjoint 

symbol-subsets X and Y to produce a set of X-based decisions and a set of Y-based decisions. 
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the sets of X-based and Y-based decisions forming the set of one-dimensional decisions, each 
of the X-based and Y-based decisions being a symbol in a corresponding symbol-subset 
closest in distance to one of the L inputs; 

^ (2) slicing each of the L iiq}Uts with respect to a symbol-set comprising all 

symbols of the two disjoint symbol-subsets to produce a set of hard decisions; and 

(3) combining eadi of the sets of X-based and Y-based decisions wifli the 
set ofhard decisions to produce the set of one-dimensional errors, each of the one-dimensional 
errors represmting a distance metric between the con^onding one-dimensional decision and 

jQ one of the L inputs. 

145. The metfiod of claim 144 wherein operations (1), (2) and (3) arc performed via 
a look-iq) table. 

J 5 146, The method of claim 145 wherein the look-up table is inq)lemented using a 

read-only-memory storage device. 

147. The method of claim 145 wherein the look-up table is implemented using a 
random-logic device. 

20 

148. The method of claim 144 wherein each of the one-dimensional errors is 
represented by one bit 

149. The method of claim 137 whwein operation (b) con^jrises the operations of: 
25 combining the one-dimensional errors to produce two-dimensional errors; 

combining the two-dimensional errors to produce intermediate L-dimensional 

errors; 

arranging the intermediate L-dimensional errors into pairs of errors such that 
the pairs of errors correspond one-to-one to the code-subsets; and 

determining a minimum for each of the pairs of mors, the minima being the 
L-dim^isional errors. 



150. The method of claim 137 wherein L is equal to 4. 

15 L The method of claim 137 whCTein the plurality of code-subsets conq>rises 2*"' 
code-subsets. 

1 52. The method of claim 151 wherein the set of one-dimmsional errors comprises 
2L one-dimcQsional errors. 
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153. The me&od of claim 152 wherein the set of Iwlimensional errors comprises 
2^' L-dimensional errors. 

1 54. The method of claim 1 53 wherein operation (b) comprises the operations of: 
combining the 2L one-dimensional mors to produce 2L two-dimensional 

errors; 

combining the 2L two-dimraisional errors to produce the 2^ intermediate L- 
dimensional errors; 

arranging the 2^ intermediate L-dim^ional errors into 2^^ pairs of errors such 
that the 2^^ pairs of errors correspond one-to-one to the 2^^ code-subs^; and 

determining a minimum for each of the 2^* pairs of errors, the minima being 
the 2^' L-dimensional errors. 

1 55. A system for computing a distance of a received word ftom a codeworxi, the 
codeword being a concatenation of L symbols selected fix)m two disjoint symbol-subsets X 
and Y, the codeword being included in one of a pluraUty of code-subsets, the received woni 
being represented by L inputs, each of the L ioputs uniquely corresponding to one of L 
dimensions, the system comprising: 

(a) a set of slicers for producing a set of one-dimensional errors from the 
L inputs, each of the one-dimoisional orors representing a distance metric between one of tiie 
L-inputs and a symbol in one of the two disjoint symbol-subsets; and 

(b) a combining module for combining the one-dimensional errors to 
produce a set of L-dimensional errors such that each of the L-dimensional errors is a distance 
of the received word from a nearest codeword in one of the code-subsets. 

156. The system of claim 155 wherein eadi of the one-dimensional errors is 
rqiresented by substantially fewer bits than each of the L iiq>uts. 

157. The systan of claim 155 whwcin the slices slice the L inputs with respect to 
eadi of the two disjoint symbol-subsets X and Y to produce a set of X-based errors, a set of 
Y-based errors and corresponding sets of X-based and Y-based decisions, the sets of X-based 
and Y-based errors forming the set of one-dimensional enors. the sets ofX-based and Y-based 
decisions forming the set of one-dimensional decisions, each of the X-based and Y-based 
decisions being a symbol in a corresponding symbol-subset closest in distance to one of the 
L inputs, each of the one-dimrasional mors representing a distance metric between a 
corresponding one-dimensional decision and one of tihe L ir^uts. 

158. The system of claim 157 wherein each of the one-dhnensional mors is 
rq)resented by 3 bits. 
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159- The system of claim 157 wherein the slicers are implemented using a look-up 

table. 

160. The system of claim 159 wh^in the look-up table is in^ilOTiented using a 
read-only-memory storage device. 

161. The system of claim 159 wherein the look-iq) table is implemented using a 
random-logic device. 

162. The system of claim 155 wherein the set of slicers comprises: 

(1) first slicers for slicing each of the L inputs with respect to each of the 
two disjoint symbol-subsets X and Y to produce a set of X-based decisions and a set of Y- 
based decisions, the sets ofX-based and Y-based decisions fi>nning the set of one-dimensional 
decisions, each of the X-based and Y-based decisions bdng a symbol in a corresponding 
symbol-subset closest in distance to one of the L inputs; 

(2) second slicers for slicing each of the L inputs with respect to a symbol- 
set comprising all symbols of the two disjoint symbol-subsets to produce a set of hard 
decisions; and 

(3) OTor-computing modules for combining each of the sets ofX-based and 
Y-based decisions with the set of hard decisions to produce the set of one-dimensional mors, 
each of the one-dimensional errors representing a distance metric between the corresponding 
one-^dimensional decision and one of the L inputs. 

163. The system of claim 162 wherem the first and second slicers and the error 
computing modules are implemented using a look-up table. 

164. The system of claim 163 wherein the look-up table is implemented using a 
read-only-memory storage device. 

165. The system of claim 163 wherein the look-iq) table is implemented using a 
random-logic device. 

166. Itie system of claim 162 wherein each of the one-dimmsional errors is 
rq>resCTt6d by one bit 

1 67. The system of claim 155 whraein the combining module comprises: 

a first S€i of adders for combining the one-dimensional errors to produce two- 
dimensional enors; . 
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a second set of adders for combining the two-dimensional errors to produce 
intermediate L-dimensional errors, the intermediate L-dimensional mors being airanged into 
pairs of errors such that the pairs of errors correspond one-to-one to the code-subsets; and 

a minimum-select module for determining a minimum for each of the pairs of 
errors, the minima being tiie L-dimensional errors. 

168. The system of claim 155 wherein L is equal to 4, 

169. The system of claim 155 wherein the plurality of code-subsets comprises 2*^' 
code-subsets. 

170. The s)r5tem of claim 169 wherein the set of one-dimensional errors comprises 
2L one-dimensional errors. 

171 . The system of claim 170 herein the set ofL-dimensional errors coirqirises 2^' 
L-dimensional errors. 

172. The system of claim 171 wherein tiie combining module comprises: 

a first set of addos for combining the 2L one-dimensional errors to produce 
2L two-dimensional errors; 

a second set of adders for combining die 2L two-dimensional errors to produce 
the intermediate L-dimensional errors, the 2^ intermediate L-dimensional errors being 
arranged into 2^' pairs of errors such that the 2^' pairs of errors correspond one-to-one to the 
2^* code-subsets; and 

a minimum-select module for determining a minimum for each of the 2^ * pairs 
of errors, the minima being the 2^* L-dimensional errors. 

173. The system of claim 155 wherein the system is included in a communication 
transceiverconfigured to transmit andreceiveinfonnationsignals encoded in accordance with 
a multi-level symbolic scheme. 

174. A method for computing a distance of a received word fix)m a codeword, the 
codeword being a concatenation of L symbols selected fiom two disjoint symbol-subsets, the 
codeword being included in one of 2*"' code-subsets, the received word being represented by 
2^^ input sets, each of the 2^' iiq)ut sets having L irq)uts, each of the L inputs imiquely 
corresponding to one of L dimensions, each of the 2^' iiq)ut sets corresponding to one of the 
2^* code-subsets, the mefliod comprising ttie operations of: 

(a) slicing each of the L irq)uts of eadi of the 2^' input sets with respect 
to each of the two disjoint symbol-subsets to produce an error set of 2L one-dimensional 
errors for each of the 2^' code-subsets; and 
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(b) combining one-dimensional errors within each of the error sets to 
produce 2" L-dimensional mors for the corresponding code-subset such that each of the 2^^ 
L-dimensional errors is a distance of the received word from one of codewords, 

1 75. The method of claim 1 74 wherein L is equal to 4, 

176. The method of claim 174 wherein operation (b) comprises the operations of: 
combining the 2L one-dimensional errors to produce 2L two-dimensional 

errors; 

combining the 2L two-dimensional errors to produce a set of 2*- intermediate 
L-dimensional errors; 

arranging the 2^ intermediate L-dimensional errors into 2^* pairs of errors such 
that the 2^^ pairs of errors correspond one-to-one to the 2^' code-subsets; and 

determining a minimum for each of the 2^* pans, the minima bemg the 2^' Lr 
dimensional errors. 

177. The method of claim 176 \dierein operation (a) comprises the operation of 
producing a decision set of 2L onie-dimoisional decisions for each of the 2^^ code-subsets. 

178. The method of claim 176 wherein operation (b) comprises the operation of 
combining one-dimensional decisions within each of the decision sets to produce 2^^ 
dimoisional decisions for the corresponding code-subset such that each of the 2" L- 
dimsisional decisions is a codeword closest in distance to the received word, the codeword 
being in one of 2^^ code-subsets included in the 2^* code-subsets. 

1 79 . The method of claim 1 74 wherein the method is performed in a communication 
transceiver configured to transmit and receive information signals encoded in accordance with 
a multi-level symbolic schmie. 

180* A symbol decoder included in a receiver configured to receive information 
encoded in accordance with a multi-level symbolic scheme and ovct a multi-dimensional 
transmission channel, the symbol decode comprising: 

an iig)ut, coiipled to receive an ir^ut signal; 

a first slicer, coupled to detect the input signal with respect to a first one of two 
disjoint one-dimensional symbol-subsets; and 

a second sheer, coi^led to detect the irq)ut signal with respect to a second one 
of tite two disjoint one-dimensional symbol-subsets; 

wherein the first slicer outputs a first decision term and a first mor tenn with 
respect to the first one of ttie two disjoint one-dimensional symbol-subsets, the second slicer 
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ou^utting a second decision tenn and a second error term with respect to the second one of 
the two disjoint one-dimensional symbol-subsets; and 

wherein each of the first and second error terms is expressed by a digital 
representation having substantially fewer bits than the irq)ut signal. 

181. The symbol decoder according to claim 180, wherein each of the first and 
second enor terms represents a distance metric between the input signal and a symbol in the 
respective one of the two disjoint one-dimensional symbol-subsets. 

182. The symbol decoder according to claim 181, wherein each of the first and 
second decision terms represents a symbol in the respective one of the two disjoint one- 
diniensional sjonbol-subsets. 

183. The symbol decoder according to claim 180, wherein each of the first and 
second error terms is expressed by a digital representation having fewer than four bits. 

184. The symbol decoder according to claim 180, wherein each of the fir^t and 
second error terms is expressed by a digital representation having fewer than three bits. 

185. The synibol decoder accordiog to claim 182, wherein the multi-dimensional 
transmission channel has L dimensions and wherein the first and second error terms are 1- 
dimensional error terms, the I-dunensional error temis being combined to generate a set of 
L-dimensional error terms such that each of the L-dimensional error terms represents a 
distance between a received L-dimensional word and a nearest codeword. 

186. A symbol decoder included in a receiver configured to receive information 
encode in accordance with a multi-level symbolic scheme and over a multi-dimensional 
transmission channel, the symbol decoder comprising: 

an input to receive an input signal; 

a first sUcer coupled to the iiqjut. the first sheer detecting the input signal with 
respect to a first one of two disjomt one-dimensional symbol-subscte; 

a second sheer coiqjled to the irqiut, the second sheer detecting the input signal 
with respect to a second one of the two diqoint one-dimensional symbol-subsets; and 

a third sheer coupled to detect the input signal with respect to a union set of 
flie two disjoint one-dimensional symbol-subsets. 

187. The symbol decoder according to claim 186, 

v^iierein the first slicer ou^uts a first decision witfi respect to the first one of 
Ae two disjoint one-dimensional symbol-subsets; 
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wherein the second slicer ouputs a second decision with respect to the second 
one of the two disjoint one-dimensional symbol-subsets; and 

wherein the third slicer ouputs a third decision with respect to the miion set 
of the two disjoint one-dimensional symbol-subsets. 

1 88. Ttie symbol decoder according to claim 1 87 further comprismg: 

a first combination logic block configured to combine the first decision with 
the third decision, the first combination logic block defining a first enror tem; and 

a second combination logic block configured to combine the second decision 
with the third decision, the second combination logic block defining a second error icon, 

1 89. The symbol decoder according to claim 1 88 finther comprising: 

a first square error generation block configured to operate on the first error term 
so as to define a square error representation th^eof; and 

a second square error generation block configured to operate on the second 
error term so as to define a square error representation thereof 

190. The symbol decoder according to claim 189, wherein each of the first and 
second combination logic blocks is unplemented using a look-up table, the combination 
operation and error term definition bdng performed by consulting an appropriate entry in the 
look-up table. 

191. The symbol decoder according to claim 190, wherein each the first and second 
square error gencaration blocks is implemcaited using a look-up table, ttie square error 
definition being performed by consulting an appropriate entry in the look-up table. 

192. The symbol decoder according to claim 190, wherein the look-up table is 
inq)l^ented in a read-only-memory storage device. 

193. The symbol decoder according to claim 190, wherein the look-up table is 
inqilCTiiented as a random logic integrated circuit. 

194. The symbol decoder according to claim 190, herein the look-up table is 
impleanented as a programmable-logic-array integrated circuit. 

195. TTie symbol decoder according to claim 190, v4iCTem each of the enx)r terms 
is expressed as a digital representation having one bit 
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1 96. A method for symbol decoding in a receiver configured to receive information 
encoded in accordance with a multi-level symbolic scheme and over a multi-dimensional 
transmission chaimel, flie method comprising: 

pro viding an input for receiving an input signal; 

detecting the input signal with respect to a first one of two disjoint one- 
dimensional symbol-subsets in a first slic^, 

detecting the input signal with respect to a second one of the two disjoint one- 
dimensional symbol-subsets in a second slicer, and 

detecting the input signal with respect to a union set of fee two disjoint one- 
dimensional symbol-subsets in a third sheer. 

197. The method according to claim 196, fiirthCT comprising: 

producing fit>m the first sheer a first decision with respect to the first one of 
the two disjoint oneKiimensional symbol-subsets; 

producing from the second slicer a second decision with respect to tiie second 
one of the two disjoint one-dimensional symbol-subsets; and 

producing bom the third slicer a third decision wifli respect to the union set of 
the two disjoint symbol-subsets. 

198. The method according to claim 197, fiirther comprising: 

combming the first decision with the thurd decision m a first combination block 
so as to define a first error term; and 

combining the second decision with the third decision in a second combination 
block so as to define a second error term. 

1 99- The method according to claim 1 98, fiirther comprising: 

operating on the first error term so as to define a square error representation 

thCTeof in a first square euor block; and 

operating on the second error term so as to define a square error represCTtation 

thaieof in a second square error block. 

200. The method according to claun 199, v^lierein each of the first and second 
combination blocks is implemented using a look-up table, the combination operation and error 
term definition being praformed by consulting an appropriate eatry in the look-iq) table. 

201. The method according to claim 200, wherein each of the first and second 
square error blocks is implemented using a look-iq) table, tiie square error definition being 
performed by consulting an ^ropriate entry m the look-up table. 
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202. The method according to claim 200, wherein the look-up table is implemented 
in a read-only-memory storage device. 

203. The method according to claim 200, wherein the look-iq> table is implemented 
as a random logic integrated circuit. 

204. The metfiod according to claim 200, wherein the look-up table is implemented 
as a programmable-logic-anBy integrated circuit. 

205. The method according to claim 200, wherem each of the first and second error 
terms is expressed as a digital repnesratation having one bit 

206. A receiver for demodulating an analog signal transmitted by a remote 
transmitter over a transmission channel, the analog signal including a first ISI component 
induced by a characteristic of a pulse shaping filter included in the remote transmitter and a 
second ISI component induced by a characteristic of the transmission channel, the receiver 
comprising: 

an analog &ont end, including an analog-to-digital converter, the analog fiont 
end receiving and convating the analog signal to a first digital signal; 

an equalizer block coupled to the analog fixmt end to receive the first digital 
signal, the equalizer block compensating the first ISI component m the first digital signal and 
outputting a second digital signal; and 

a decision feedback sequoice estimation block, including an ISI compensation 
circuit, the ISI compmsation circuit receiving the second digital signal outputted by the 
equalizer block and cpmpmsating the second ISI component in the second digital signal, 

207. Hiereceivo-of claim 206 wherein flie equalizer block comprises: 

an ISI compensation filter having an impulse response substantially inverse of 
the impulse response of the pulse shaping filter of the remote transmittCT; and 
an adaptive gain stage. 

208. The receiver of claim 207 herein the ISI compensation filter is an inverse 
partial response filter. 

209. Hie receiver of clann 208 wherein the equalizer block is configured as a 
feedforward equalizer. 

210. The receiver of claim 206 wherein the decision feedback sequence estimation 
block comprises: 
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a decoder block receiving and decoding at least one ISI compmsated signal 
sample, and gmerating tentative decisions and a final decision; and 

adecision feedback equalizer coiqiled in feedback fashion to the decoder block, 
the decision feedback equalizer including a set of low-ordered coeflScients and a set of high- 
ordered coefficients, the decision feedback equalizer generating a first portion of ISI 
compensation for the second ISI componmt based on the tentative decisions and the high- 
ordered coefficients. 

211. The receiver of claim 210 wherein the decision feedback sequence estimation 
block finther comprises a convolution engine coupled to the decision feedback equalizer to 
receive values of the low-ordered cocfficimts, the convolution engine computing a set of pre- 
computed values representing a set of potential second ISI compensation portions for the 
second ISI component 

21 2. Hie receiver of claim 211 M*erein the second digital signal is combined with 
the first portion of ISI compensation to produce a third digital signal partially compensated 
for the second ISI component. 

213. The receiver of claim 212 wherein the decision feedback sequence estimation 
block fiuther comprises a multiple decision feedback equalizer coupled to the decision 
feedback equalizer and the convolution engine, the multiple decision feedback equalizer 
combining the set of pre-computed values with the third digital signal to produce a set of 
potential digital signals, one of the potential digital signals being substantially compensated 
for the second ISI coiiq>onent. 

214. The receiver of claim 213 wherein the multiple decision feedback equalizer 
includes a multiplexer, the multiplexer receiving the set of potential digital signals, selecting 
one of the potential digital signals based on at least one tentative decision received fiom the 
decoder block, and providing the selected one of the potential digital signals to the decoder 
block as the at least one ISI compensated signal sample. 

215. A method for demodulating an analog signal transmitted by a rraiote 
transmittCT over a transmission channel, the analog signal including a first ISI compon^t 
induced by a diaracteristic of a pulse shying filter included in the remote transmitter and a 
second ISI component induced by a characteristic of the transmission channel, the method 
con^rising tfie operations of: 

(a) receiving die analog signal in an analog fix>nt end; 

(b) conv^ting the analog signal to a first digital signal; 

(c) compensating the first ISI component included in the first digital signal using 
an equalizer blod^ 
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(d) producing a second digital signal, the second digital signal being substantially 
equal to the first digital signal with the first ISI component substantially compensated; 

(e) compensating flie second ISI component included in the second digital signal; 
(0 producingathiiddigitalrignal,thethiiddigitalsignalbeingsubstantid^^ 

to the second digital signal with the second ISI component substantially compensated; and 
(g) decoding the third digital signal: 

216. The method according to claim 215 wherein operation (c) comprises the 
operations o£ 

filtering the first digital signal using an ISI compensation filter, the ISI 
condensation filter having an impulse response substantially inverse of the impulse response 
of the pulse stuping filter of the remote transmitter; and 

adjusting a gain of the filtered first digital signal using an ads^tive gain stage. 

217. The method according to claim 216 wherem the ISI compensation filter is an 
inverse partial response filter. 

218. The method according to claim 217 wherein the equalizer block is configured 
as a feedfonvard equalizer. 

219. The method according to claim 215 wherein operation (g) comprises: 
genemting tentative decisions from a decode block; and 

combining tentative decisions \^th values of a set of high-ordered coefficients 
in a decision feedbadc equalizer to generate a first portion of ISI comp^isation for the second 
ISI component. 

220. The inethod according to claim 2 19 wherein operation (e) comprises: 
defining a set of low-order coefficients from the decision feedback equalizer; 
combining the set of low-order coefficients with a set of values representing 

levels of a multi-level alphabet so as to generate a set of pre-computed values representing a 
set of potential second portions of ISI compensation for the second ISI component. 

221 . The m^od according to claim 220 \^erein operation (e) finth^ comprises 
the operation of combining the first portion of ISI compensation to the second digital signal 
to generate an intermediate digital signal having the first ISI component substantiaUy 
compensated and the second ISI component partially compensated. 

222. Tlie method according to claim 221 wherein operation (e) further comprises 
tiie opra^on of combining the set of pre-computed values with the intermediate digital signal 
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to define a set of potential digital signals, one of the potential digital signals being 
substantially compensated for both the first and the second ISI components. 

223. The method according to claim 222 wherein operation (e) fiirthcr comprises: 
selecting one of the potential digital signals based on at least one tentative 

decision received &om the decoder block, the selected one of the potential digital signals 
being the third digital signal. 

224. A receiver for demodulating an analog signal transmitted by a remote 
transmitter over a transmission channel, the analog signal including a first ISI component 
induced by a characteristic of a pulse shaping filter included in the remote transmitter and a 
second ISI conq)onent induced by a characteristic of the transmission channel, the receiver 
comprising: 

an analog fiont end including an analog-to-digital converter, the analog front 
end receiving the analog signal and outputting a first digital signal; 

a first ISI compensation circuit receiving the first digital signal &om the analog 
&ont end and outputting a second digital signal, the second digital signal bemg substantially 
compensated for the first ISI component; and 

a second ISI compensation circuit, the second ISI compensation circuit 
receiving the second digital signal and generating a thud digital signal, Ae third digital signal 
being substantially compensated for the second ISI component. 

225. The receiver according to claim 224 wherein the first ISI compensation circuit 
comprises: 

an inverse partial response filter; and 

wherein the inverse partial response filter has an impulse response substantially 
inverse of the impulse response of the pulse shapmg filter of the remote transmitter, so as to 
substantially compensate the first digital signal for the first ISI component. 

226. The receiver according to claim 225 wherein the inverse partial response filter 
is implemented with a diaracteristic feedback gain factor K. 

227. The receive according to claim 226 wherein the inverse partial response filter 
operates in accordance with a non-zero value of the characteristic feedback gain fiictor K 
during communication initialization and wherein the value of the feedback gain fector K is 
ranged down to zero after a pre-defined interval. 

228. The receiver according to claim 227 wherein the second ISI con^iensation 
circuit fiirther comprises: 
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a Viterbi decoder configured to decode the third digital signal and generate 
tentative decisions; and 

feedback equalizer circuitry coupled to the Viterbi decode, the feedback 
equalizer circuitry receiving the tentative decisions and combining the tentative decisions witfi 
a set of high-ordered coefiGcients to generate a first value. 

229. Hie receiver according to claim 228 wherein the second ISI compensation 
circuit fiuther comprises: 

siunming circuitry combining the first value with the second digital signal, the 
summing circuitry outputting an intermediate signal; and 

a multiple decision feedback equalizer receiving the intmnediate signal and 
combining the intermediate signal with a set of pre-computed values generated by combining 
values of a set of low-ordered coefScients with a set of values representing levels of a multi- 
level symbolic alphabet to produce a set of potential digital signals, one of the potential digital 
signals being substantially ISI compensated, the multiple decision feedback equalizer 
outputting said one of the potential digital signals to the Viterbi decoder. 

230. The receiver according to claim 229» wherein the feedback equalizer circuitry 
is a decision feedback equalizer having a multiplicity of coefficients, the low-ordered 
coefficients comprismg the first two coefficients of the multiplicity of coefficients. 

23 L The receiver according to claim 230, wherein the high-ord^^ coefficients 
comprise the remaining coefficients of the multiplicity of coefficients. 

232. The receiver according to claim 23 1 , wherein the characteristic feedback gain 
fector K is ramped to zero after convergence of the decision feedback equalizer. 

233. The receiver according to claim 224 herein the receiver is included in an 
integrated circuit transceiver. 

234. A method for dynamically regulating power consumption of a high-speed 
integrated circuit including a multiplicity of processing blocks, the method comprising the 
operations of: 

defining a first metric relating to a first performance parameter of the integrated 

circuit; 

defining a second metric relating to a second performance parameter of the 
integrated circuit ; 

setting the first metric to a pre-defined value; 

disabling selected ones of the multiplicity of processing blocks in accordance 
with a set of pre-d^emiined pattmis; 
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evaluating tiie second metric, while perfonning the disabling operation, to 
gen^ate a range of values of the second metric, each of said values corresponding to the pre- 
defined value of the first metric; 

identifying a most desirable value of the second metric fiiom the range of said 

values; and 

matching the most desirable value of the second metric to a corresponding one 
of the pre-detOTnincd pattons; 

wherdn the integrated circuit is subsequently operated with selected ones of the multiplicity 
of processing blocks disabled in accordance with the pre-detennined pattern corresponding 
to the most desirable value of the second metric. 

23 5. The method of claim 234, whaein the first and second performance parameters 
are distinct and are chosen fiom the parametric group consisting of power consumption and 
a signal quaUty figure of merit 

236. The method of claim 235 wherein the integrated circuit is a bidirectional 
conmiunication device and whwein the multiplicity of processing blocks comprises multi-tap 
digital filter elements, wherein the disabling operation comprises the operation of adaptively 
disabling taps of one of the multi-tap digital filter elemoits. 

237. The method of claim 235, wherein the integrated circuit is a bidirectional 
conmiunicationdcviceandwhcmnthemultipUcity of processiag blocks comprises a decoder, 
including at least a symbol-by-symbol decode and a trellis decoder, wherein the disabhng 
operation comprises the operation of disabling the trellis decoder. 

238. A method for dynamically regulating power consumption of a commimication 
system including a first module, the method comprising the operations of: 

specifying at least one of a specified pow«^ and a specified error, 
conqjuting at least one of an information error metric and a power metric; and 
controlling activation and deactivation of at least a portion of the first module 
of the communication system according to a criterion, the criterion being based on at least one 
of the information error metric, the powa- metric, the specified ator and the specified power, 
to regulate at least one of the mformation error metric and the power metric. 

239. The method of claim 238 wherein the criterion is die following: 

activate if the information error metric is strictly greater than the specified 

CTPor; and 

deactivate if the information OTor metric is strictly smaller than the specified 

mor. 
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240. The method of claim 238 wherein the criterion is the following: 

activate if the information OTor metric is strictly greatCT than the specified error 
and the power metric is strictly smaller than the specified power; and 

deactivate if the information error metric is strictly smaller than the specified 
error or the power metric is strictly greater than flie specified power. 

241 . The method of claim 238 whereia the criterion is the following: 

activate if flie information error metric is greater than the specified error; and 
deactivate if the information error metric is smaller than a target value, the 
target value being smaller than the specified error. 

242. The method ofclaim 238 wherein the criterion is the following: 

activate if the information error metric is greater than the specified error and 
the power metric is strictly smaller than the specified power, and 

deactivate if fte information error metric is smaller ttian a target value, the 
target value being smaller than ttie specified error, or the power metric is strictly greater than 
the specified power. 

243. Tlie method of claim 238 wherein the Information error metric is related 
to a bit error rate of the communication system. 

244. The method of claim 238 wherein the information error metric is a 
measurement of performance degradation of the communication system caused by 
deactivation of the portion of the first module of the communication system. 

245. The method ofclaim 238 wherein the communication system comprises 
a gigabit transceiver, the gigabit transceiver including a set of constituent transceivers. 

246. The method of claim 238 wherein the first module is a filter included in 
one of the constituent transceivers of the gigabit transceiver. 

247. The method of claim 246 herein the filter comprises a set of taps, each 
of the taps including a filt^ coefficient 

248. The method of claim 247 wherein the information error metric is a 
measurement of performance degradation of the gigabit transceivCT caused by the filter. 

249. The m^od of claim 247 wherein eadi of the constituent transceivers 
includes a plurality of filters and wherein the information OTor metric is a measuronent 
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of perfonnance degradation of the gigabit transceiver caused by a plurality of filters 
included in a respective one of the constituent transceivers. 

250. The method of claim 247 wherein each of the constituent transceiv^ 
includes a plurality of filters and wherein the information error metric is a measurement 
of performance degradation of flie gigabit transceiver caused by filters included in all of 
the constituent transceivois. 

251. The method of claim 247 wharein the information error metric is a 
measurement of a change in performance degradation of the gigabit transceiver caused 
by the filter. 

252. The method of claim 247 wherein each of the constituent transceivers 
includes a plurality of filters and x^erein the information error metric is a measurement 
of a change in perfonnance degradation of the gigabit transceiver caused by a plurality 
of filters included in a respective one of the constituent transceivers. 

253. Hie method of claun 247 wherein each of the constituent transceivers 
includes a plurality of filters and whoiein the information oror metric is a measurement 
of a change in performance degradation of the gigabit transceiver caused by filters 
included in all of the constituent transceivers, 

254. The method of claim 245 wherein the first processing module is a trellis 
decoder included in one of the constituent transceivers of the gigabit transceiver. 

255. The method of claim 245 wherein the first processing module is a symbol- 
by-symbol decode* included in one of the constituent transceivers of the gigabit 
transceiver. 

256. A system for djmamically regulating power consumption of a 
communication system, the communication system including a first module, the system 
regulating at least one of a powa: metric and an information error metric based on at least 
one of a specified power and a specified raror, the system comprising: 

a control module controlling activation and deactivation of at least a portion of 
the first module of the commimication system according to a CTiterion, the criterion being 
based on at least one of the information error metric, the power metric, the specified error 
and fhe specified power, and 

a computing module coiq)led to the control module, the computing module 
computing at least one of the information aror metric and the power metric. 
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The system of claim 256 wherein the criterion is the following: 
activate if the infonnation error metric is greater than the specified error; 

deactivate if the infonnation error metric is smaller tiian the specified 

258. The system of claim 256 wha^in the criterion is the following: 
activate if the information error metric is greater than the specified error 

and the power metric is smaller than the specified power; and 

deactivate if the iiifpnriationerrormetric is sinaller than thcspecified error 
or the power metric is greater flian the specified power. 

259. The system of claim 256 wherein the infonnation error metric is related 
to a bit error rate of the communication system. 

260. The system of claim 256 wherein the infonnation error metric is a 
measurement of performance degradation of the commmiication system caused by 
deactivation of a portion of the first module of the communication system. 

26 L The system of claim 256 wherein the commimication syst^ comprises 
a gigabit transceiver, the gigabit transceiver including a set of constituent transceivers. 

262. The system of claun 261 wherem the first module is a filter included in 
one of the constituent transceivers of the gigabit transceiver. 

263. The ^stem of claim 262 wherein the filter comprises a set of taps, each 
of the taps including filter coefficient 

264. The system of claim 263 wherein the information error metric is a 
measurement of performance degradation of the gigabit transceiver caused by the filter. 

265. The system of claim 263 wherein the information error metric is a 
measurement of p^ormance degradation of the gigabit transceiver caused by all filters 
in said one of the constituent transceiv^. 

266. The system of claim 263 wherein ttie information error metric is a 
mrasurement of performance degradation of the gigabit transceiver caused by all filters 
in the constituent transc^ets. 
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267. The system of claim 263 wherein the infomation error metric is a 
measurement of a change in performance degradation of the gigabit transceiver caused 
by the filter. 

268. The system of claim 263 wherein the information error metric is a 
measurement of a change in performance degradation of the gigabit transceiver caused 
by all filters in the one of the constituent transceivers. 

269. The system of claim 263 ^^*erein the information error metric is a 
measurement of a change ui performance degradation of the gigabit transceiver caused 
by all filters in the constituent transceivers. 

270. The system of claim 261 wherein the first module is a trellis decoder 
included in one of the constituent transceivers of the gigabit transceiver. 

271- The system of claim 261 wherein the first module is a symbol-by-symbol 
decoder included in one of ttie constituent transceivers of the gigabit transceiver. 

272. A method for reducing power consumption of an integrated circuit digital 
filter, the filter having an initial set of active coefiScients, an input and an output, the 
active ooefQcients being ordered, a lowest ordered active coefficient being proximal to 
the input, each of the active coefBcients having a stable value, the method comprising the 
operations of: 

(a) detemuning a threshold value; 

(b) comparing an active coefficient with the threshold value; and 

(c) deactivating the active coeffici^t if a criterion is satisfied. 

273. The method of clairn 272 finther con^rising the operation of: 

(d) repeating the operations of comparing and deactivating for other active 
coefficients of the initial set 



274. The metiiod of claim 273 fiuther conq)rising the operations of: 

(e) providing a specified error, 

(f) computing an error me^c; 

(g) iqxiating the threshold if a first criterion is satisfied; and 

(h) iqieating op^tions (b), (c) and (d) if the threshold is updated 



275. The m^od of claim 274 wherein oparation (g) comprises the opCTation 
of increasing the threshold if the error metric is smaller than the specified error. 
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276. The method of claim 274 wherein opeiatioii (g) comprises the operation 
of decreasmg the threshold if the error metric is larger than the specified enror. 

277. The method of claim 273 further comprising the opCTations ofi 
(e) providing a specified error and a specified power; 

(0 computing an error metric and a power metric; 

(g) updating the threshold if a first criterion is satisfied; and 

(h) repeating opCTations(b), (c) and (d) if the threshold is updated, 

278. The method of claim 277 wiiCTein operation (g) comprises the operation 
of increasing the threshold if the error metric is smaller than the specified error or the 
power metric is larger than the specified power. 

279. The method of claim 277 wherein operation (g) comprises the operation 
of decreasing the threshold if the error metric is larger than the specified error and the 
power metric is smaller than the specified power, 

280. The method of claim 272 wherein the criterion in operation (c) is satisfied 
if the active coeflSdent has a stable value smaller than the threshold. 

28 1 . The method of claim 272 wherein operation (a) comprises the operation 
of equating the threshold to a minimum of the stable values of the active coefficients, 

282. Tlie method of claim 272 wherein operation (a) comprises the operation 
of equating the threshold to a simulation test result. 

283. The method of claim 274 wherein the error metric is a mean squared error. 

The method of claim 274 wherein operation (f) comprises the operations 

squaring the stable values of deactivated coefficimts; 
surmning the squared stable values to produce a sum; and 
equating the enor metric to the sum of the squared stable values. 

285. The method of claim 274 wherein the specified error corresponds to a 
mean squared error to signal ratio of approximately -24 dB. 

286. The method of claim 272 wherein the filter is configured to substantially 
attenuate an echo signal included in a received signal. 



284. 

of: 
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287. The method of claim 272 wherein the filter is configured to substantially 
attenuate a near-end crosstalk signal included in a received signal. 

288. Hie method of claun 272 whaem the filter is configured to substantially 
attenuate a fia--cnd crosstalk signal included in a received signal. 

289. The method of claim 272 whandn the filter is configured to receive a 
signal including symbols at a symbol rate of about 125 megaHertz, each of the symbols 
including three bits. 

290. The method of claim 289 whoein the filter is mcluded in a transceiver, 
flie transceivo: being included in a single-chip gigabit transceiver. 

291. The method of claim 272 wherein the filter has substantially more than 
120 ordered active coeflBcients, 

292. The method of claim 272 fiirther comprising the operation of repeating 
operations (b) and (c) for the remaining active coefficiwits of the initial set in asequential 
order according to the order of the active coefficients. 

293. The method of claim 272 wherein N consecutive active coefficients 
including the lowest ordered coefficient are exempt fiom deactivation. 

294. The method of claim 272 fiirther comprising the operation of training the 
active coefficients of the initial set such fliat the active coefficients converge to the stable 
values. 



295. The method of claun 294 wherein the active coefficimts of the initial set 
are trained with a least mean squares algorithm. 

296. The method of claim 294 wherein the operation of training ttie active 
€x>efficients of the initial set comprises the operations of: 

(1) training Nl consecutive ordered active coefficients including the lowest 
ordered active coefficient such that the Nl active coefficients conv^ to first values; 

(2) training N2 consecutive active coefficimts, including the Nl active 
coefficients, such that flie N2 active coefficirats converge to second values, N2 being a 
numba: greater than Nl; and 

(3) repeating opaation (2) with N2 substituted by N3, N3 being a number 
greater than N2, until the active coefficients of the initial set converge to the stable 
values. 
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297. The method of claim 272 further comprising the operations of: 
activating a deactivated coefiBcient of the filter; 

training the activated coefficient such ^t the activated coefficient 
converges to a stable value; 

con^aring the activated coefficient with the threshold; and 
deactivating the activated coefficient if the criterion is satisfied. 

298. The method of claim 274 wh^ein the filter is a finite-impulse-response 
filter, the filter including a set of delay line sections, the delay line sections corresponding 
one-to-one with the active coefficients of the initial set, except for the lowest ordered 
coefficient of the initial set 

299. The metiiod of claim 298 fiirther comprising the operation of: 
deactivating delay line sections corresponding to deactivated coefficients 

that have higher orders than a highest ordered non-deactivated coefficient 

300. A method for reducing power consunq)tion of a filter, the filter having an 
initial set of active coefficients and an iiqsut, the active coefficients being ordered, a 
lowest ordered active coefficient being proximal to the input, the method comprising the 
operations of: 

(a) determining a threshold; 

(b) providing a specified error; 

(c) initializing the active coefficients by assigning an initial value to each of 
the active coefficients; 

(d) training Nl consecutive active coefficients including the lowest ordered 
active coefficient such that the Nl active coefficients converge to first values; 

(e) reducingtheNl active coefficients to Ml active coefficients based on the 
threshold. Ml being a number smaller or equal to Nl ; 

(f) training N2 consecutive active coefficients including the Ml active 
coefficients such that the N2 active coefficimts conve]:ge to second values, N2 being a 
number greatCT than Ml; and 

(g) rq>eating op^ations (e) and (f) withNl substituted by N2, Ml substituted 
by M2, M2 being a number smalls or equal to N2, andN2 substituted by N3, N3 being 
a numb^ greato: than M2, until the set of active coefficients is reduced to a subset of 
active coefficients having stable values. 

301 . The method of claim 300 wherein oporation (e) comprises the opemtions 

of: 

con^)aring an active coefficient with the tiireshold; and 
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deactivating the active coefiBcient if a criterion is satisfied for the active 
coefiScicnt. 

302. The method of claim 301 wherein the criterion is satisfied if the active 
coefficient has a value smaller than the threshold. 

303. A system for reducing power consumption of a filter, the filter having an 
initial set of active coefficients, an itqiut and an output, the active coefficients being 
ordered, a lowest ordered active coefficient of the initial set being proximal to the input, 
each of the active coefficients having a stable value, the system comprising: 

a threshold module generating a flueshold; 

a comparing module coiq)Ied to the threshold module, the comparing 
module comparing an active coefficient with the threshold; and 

a decision module coupled to the comparing module, the decision module 
deactivatuig die active coefficient according to a criterion. 

304. The system of claim 303 wherein the decision module deactivates the 
active coefficient if the active coefficient has a value smaller than flie threshold. 

305. TTic s>^tem of claim 303 fiirtha: comprising: 
a buflFer providing a specified enx)r, 

an ciroT conqsuting module computing a error metric; and 
a second comparing module coupled to the buffer, flie error computing 
module and the threshold module, the second comparing modxde comparing the error 
metric with the specified error and producing a first control signal to the threshold 
module when the error metric is smaller than the specified error and a second control 
signal to the threshold module vAim the error metric is larger than the specified error. 

306. The system of claim 305 wherem the threshold modxile updates the 
threshold upon recq)tion of the first or second control signal. 

307. Amethod for reducing system performance degradation due to switching 
noise in a system, the system comprising a set of subsystems, each of the subsystems 
conqirising an analog section and a digital section, each of the analog sections operating 
in accordance with a corresponding one of a set of sampling clock signals, the sampling 
clock signals being synchronous in firequency, the digital sections operating in 
accordance with a receive clock signal, the method comprising the operations of: 

generating the receive clock signal such that the receive clock signal is 
synchronous in firequmcy with the sampling clock signals and having a phase ofi&et with 
respect to one of the sampling clock signals; and 
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adjusting the phase ofiEset such that system performance degradation due 
to coupling of switching noise from the digital sections to the analog sections is 
substantially minimized 

308. The method of claim 307 who-ein, in the operation of adjusting the phase 
oflBset, the phase of&et is adjusted such that a time difference between a transition 
occurrence of the receive clock signal and transition occurrences of sampling clock 
signals, that arc adjacoit in time to the transition occurrence of the receive clock signal, 
is substantially maximized. 

309. The method of claim 307 wherein the operation of adjusting the phase 
ofiset of the receive clock comprises the operations of: 

(1) determimng a set of phase ofl&et values for the phase offeet; 

(2) computing a set of system performance errors corresponding one-to-one to the 
phase offiet values; and 

(3) selecting one of the phase of&et values, said one phase ofi&et value 
corresponding to a minimum of the systan performance errors. 

310. The method of claim 3 09 wherein the set of phase of&et values comprises 
64 phase ofi&et values. 

311. The method of claim 309 wherein operation (2) comprises the operations 

of: 

computing a subsystem performance error, corresponding to one of the 
phase of&et values, for each of the subsystems; 

combining tiie subsystem paibrmance errors to generate the 
corresponding syst^ performance error. 

3 12. The method of claim 31 1 wherein the operation of computing asubsystem 
performance error for a corresponding subsystem comprises: 

squaring a slicer error associated with the subsystem; 

accumulating a numb^ of associated squared sheer errors via a filter for 
a period of time; and 

ou^utting an accumulated squared enx»r as the subsystem performance 
mor after ttie period of time. 

313. The method of claim 307 fiiitha: comprising the operation of 
adjusting a sanq)ling phase of at least one of the sampling clock signals 

such that a subsystem performance error of the subsystem which corresponds to said one 
of the sampling clodc signals is substantially minimized. 
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314. The method of claim 313 wherein the operation of adjusting the sanq)ling 
phase of at least one of the sampling clock signals comprises the operations of: 

(1) detennining a set of sanqiling phase values for the sampling phase; 

(2) conq>uting a set of subsystem perfomiance errors corresponding one- 
to-one to the sampling phase values; and 

(3) selecting one of the sampling phase values, said one sampling phase 
value corresponding to a mmunum of the subsystem performance errors. 

315. The method of claim 314 wherein the set of sampling phase values 
conq>rises 16 sampling phase values. 

316. The method of claim 3 14 whereinthe operation of computing a subsj^stem 
performance error for the corresponding subsystem comprises: 

squaring a slicer error associated with the subsystem; 

accimiiilating a niunber of associated squared slicer errors via a filter for 
a period of time; and 

outputting an accumulated squared error as the subsystem performance 
error after the period of time. 

317. The method of claim 307 further comprising the operation of: 
adjusting a sampling phase of each of the sampling clock signals such that 

a subsystem performance error of a corresponding subsystem is substantially minimized. 

318. A method for reducing effect of switching noise in a system, the s>^em 
comprising a set of subsystems, each of the subsystems comprising an analog section and 
a digital section, each of the analog sections operating in accordance with a 
corresponding one of a set of sampling clock signals, the digital sections operating in 
accordance with a receive clock signal, the method comprising the operations of: 

generating the san:q)ling clock signals such that the sampling clock signals 
are synchronous in finequency with each other; 

generating the receive clock signal such that the receive clock signal is 
synchronous in fi^quency with &e sanq)ling clock signals and having a phase o&et with 
respect to one of the sampling clock signals; and 

adjusting the phase o£&et such that effect of switching noise fix>m the 
digital sections on the analog sections is substantially minimized. 

319. Hie method of claim 318 wherein, in the op^ation of adjustiag the phase 
ofi&et, fho phase ofiGset is adjusted such that time difference between a transition 
occurrence of the receive clock signal and transition occurrences of saropling clock 



"132. 



wo 00/28691 



PCT/US99/26493 



signals that are adjacent in time to the transition occurrence of the receive clock signal 
is substantiaUy maximized 

320. The method of claim 318 fiirthCT comprising the operation of. 
adjusting a phase of at least one of the san5)ling clock signals such that 

a subsystem performance OTor of the subsyst^ which corresponds to said one of the 
sampling clock signals is substantially minimized. 

321. The method of claim 318 wherein the operation of generating the sampling 
clock signals comprises the opoations of: 

(a) generating a phase error for each of the sampling clock signals fiom a 
corresponding phase detector, 

(b) inputting eadi of the phase errors to a coiresponding loop filter; 

(c) generating filtered phase errors fiom the corresponding loop filters; 

(d) ii9utting each of the filtered phase errors to a corresponding oscillator, 

(e) generating phase control signals bom the corresponding oscillators; 
(^inputting each of the phase control signals to a corresponding phase 

selector; and 

(g)generating the sampling clock signals fix>m the corresponding phase 

selectors. 

322. The mefliod of claim 321 wherein the operation of generating the receive 
clock signal comprises the operations of: 

(1) combining one of flie phase control signals with the phase oflfeet to 
piroduce a phase shift value; 

(2) iiq)uttmg the phase shift value to a receive clock phase selector, and 

(3) generating the receive clock signal firom the receive clock phase 

selector. 

323. The method of claim 322 wherein the phase shift value comprises a set of 
phase steps and wherein operation (2) comprises the opoation of inputting the phase 
steps consecutively to flie receive clodc phase selector. 

324. The method of claim 322 whorein the ope^on of adjusting tire phase 
ofiGset of the receive clock comprises tfie operations o£ 

(4) determining a set of phase ofi&et values for the phase of&et; 

(5) computing a set of systan pCTformance errors corresponding one-to- 
one to the phase o£&et values; and 

(6) selecting one of the phase of&et values, said one phase ofi^ value 
corresponding to a minimum of the system performance OTors. 
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325. The method of claim 324 wherein the set of phase offset values comprises 
64 phase ofi&et values. 

326. The method of claim 324 wherein op^ation (5) comprises the operations 

of: 

computing a subsystem performance error for each of the subsystems for 
one of the phase o&dt values; 

combining the sub^strai performance errors to generate the corresponding system 
performance mor. 

327. The method ofclaim 326 wherein the operation ofcomputing a subsystem 
performance error for a corresponding subsystem comprises: 

squaring a slicer enor associated with the subsystem; 
accumulating a number of associated squared slicer errors via a filter for 
a period of time; and 

outputting an accumulated squared error as the subsystem performance 
error after the period of time. 

328. The m^od of claim 321 \\4ierein, in operation (a), each of the phase 
detectors receives a corresponding sheer oror and a corresponding tentative decision 
ftom a decoding sjrstem. 

329. The method ofclaim 328 wherein operation (a) comprises: 

(1) generating a pre-cursor phase error by multiplying the corresponding 
tentative decision by a delayed version of ttie corresponding slicCT error; 

(2) generating a post-cursor phase error by multiplying the corresponding 
sheer error by a delayed version of the corresponding tentative decision; and 

(3) combining the pre-cursor and post-cursor phase errors to produce the 
corresponding phase error. 

330. The mettiod of claun 329 wherein operations (1), (2) and (3) are 
p^onned via a lattice stmcture, the lattice structure comprising two delay elements, two 
multiplier and an addo*. 

33 1 . The method ofclaim 330 wha-ein operation (3) includes the operation of 
combining the pre-cursor, post-cursor phase errors and an oflfeet mput from a control unit 
to produce the corresponding phase error. 

332. TTie method ofclaim 33 1 wherein opa:Btion (c) comprises: 
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accumulating a number of consecutive values of one of the phase errors 
via a first filter, resulting in a sum value; 

ou^utting the sum value fiom the first filter; 

integratmg the sum value via a second filta: to produce an integral value; 

and 

combining the sum value and the integral value to produce a filtered phase 

enor. 



333. The method ofclaim 332 vdiOTcin operation (3) includes the operation of 
scaling the integrated sum value by a scale factor to produce the integral value. 

334. The method of claim 332 wh^ein operation (c) fiulher comprises, before 
operation (3), the operation of multiplying the sum value by a fector different than 1 
when the system is operating in a different bandwidth mode. 

335. The method of claim 32 1 wherein op^tion (e) comprises the operation 
of filtering recursively the filtered phase errors to produce the corresponding phase 
control signals. 

336. Hie method of claim 335 wherein operation (e) fiulher comprises the 
operation of scaling, before filtmng recursively, the filtered phase errors by a scale 
&ctor. 

337. The method of claim 321 wherein operation (g) comprises the operations 

o£ 

inputting a multi-phase input signal fiiom a clock genemtor to each of ttie 
phase selectors; and 

selecting at each of the phase selectors one of the phases of the multi- 
phase iiq>ut signal based on the phase control signal received bom the corresponding 
oscillator. 



338. Amethod for gmerating aset of clock signals in a system, the sei of clock 
signals conqmsing a set of san^)ling clock signals, the system comprising a set of 
subsystems, each of the subsystons comprising an analog section, each of the analog 
sections opiating in accordance with a coiiesponding one of the sanqiling clock signals, 
the method comprising the operations of: 

gen^ating a phase earor for each of the sampling clock signals fiom a 
corresponding phase detector, 

ii^utting each of the phase errors to a corresponding loop filter, 
gmeiBting filtered phase mors fiom the corresponding loop filters; 
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selector; and 
selectors. 



inputting each of the filtCTed phase errors to a corresponding oscillator, 
generating phase control signals from the conresponding osciUatore; 
inputting each of the phase control signals to a corresponding phase 

generating the sampling clock signals from the correspondmg phase 



339. He method of claim 338 wherein the set of clock signals fiirther 
comprises a receive clock signal and wherein each of the subsystems fiirther comprises 
a digital section, the digital sections operating in accordance with the receive clock 
signal. 

340. Themethodofclaim339whereinthereceiveclocksignalisrelatedtoone 
of the sarr^ling clock signals. 

341 . The method of claim 340 fiirther con5)rising the operations of: 
combining one of tiie phase control signals with a receive clock oflfeet to 

produce a phase shift value; 

inputting the phase shift value to a receive clock phase selector, and 
generating the receive clock signal from the receive clock phase selector. 

342. The method of claim 341 wherein flie phase shift value coirprises a set of 
phase steps and wherein the inputting operation comprises the operation of inputting one 
phase step of the phase shift value at a time to the receive clock phase selector. 

343. The method of claim 338 wherein the set of clock signals finther 
comprises a transmit clock signal and \^iiercin each of the subsystems fiuther comprises 
a transmit section, the transmit sections operating in accordance with the transmit clock 
signal. 

344. The method of claim 343 fiirther con5)rising the operations o£ 
iiqiutting a transmit clock oflEset to a transmit clock phase selector, and 
generatrngthetranmiitclocksigndfiomthetransmitclockphaseselector. 

345. The method of claim 344 wherein the transmit clock oflfeet is equal to 

zero. 

346. The melhod of claim 343 wherein the transmit clock signal is related to 
one of the sampling clodc signals. 
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347. The method of claim 346 further con^rising the operations of: 
inputting one of the phase control signals to a transmit clock phase 

selector, and 

generating ttie transmit clock signal from the transmit clock phase selector 

348. The method of claim 338 wherein each of the phase detectors receives a 
corresponding sheer error and a corresponding tentative decision from a decoding system. 

349. The method of claim 348 wherem operation (a) comprises: 

(1) generating a pre-cursor phase error by multiplying the corresponding 
tentative decision by a delayed version of the corresponding sheer error; 

(2) generating a post-cursor phase errorby multiplying the corresponding 
sheer error by a delayed version of the corresponding tentative decision; 

(3) combining the pre-cursor and post-cursor phase errors to produce the 
connesponding phase error. 

350. The method of claim 349 wherein operations (1). (2) and (3) are 
performed via a lattice structure, the lattice structure comprismg two delay elements, two 
muItipUers and an adder. 

351. The method of clakn 349 wherein operation (3) mcludes the operation of 
combining the pre-cursor, post-cxnsor phase errors and an offset input from a control imit 
to produce the corresponding phase error. 

352. The method of claim 338 wherein operation (c) comprises: 

(1) accumulating a number of consecutive values of one of the phase 
errors via a first filta-, resulting in a sum value; 

(2) outputting the sum value from die first filter; 

(3) integrating the sum value via a second filter to produce an integral 

value; and 

(4) combinmg the sum value and the integral value to produce a filtered 

phase error. 

353. The method of claim 352 ^aiein op^ation (3) includes the operation of 
seating the integrated sum value by a scale factor to produce flie integral value. 

354. The method of claim 352 wherein operation (c) furtfier comprises, before 
operation (3), the operation of multiplying the sum value by a fector diff^ent than 1 
when tiie system is operating in a different bandwidth mode. 
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355. The method of claim 338 wherein operation (e) comprises the operation 
of filtering recursively the filtered phase errors to produce the corresponding phase 
control signals. 

356. The me&od of claim 355 wherein operation (e) further comprises the 
operation of scaling, before filtering recursively, the filtered phase errors by a scale 
fector. 

357. The method of claim 338 wherein operation (g) con^rises the operations 

of: 

(1) inputting a multi-phase input signal Scorn a clock generator to each of 
the phase selectors; and 

(2) selecting at each of the phase selectors one of the phases of the multi- 
phase input signal based on the phase control signal received fix)m the corresponding 
oscillator. 

358. A timing recovery system for generating a set of clock signals in a 
processing system, the set of clodc signals comprising a set of sampling clock signals, 
the processing system comprising a set of processing subsystems, each of the processing 
subsystems co^^>rising an analog section, each of the analog sections opiating in 
accordance with a corresponding one of the sampling clock signals, the timing recovery 
system comprising: 

(a) a set of phase detectors generating phase errors for the corresponding 
sampling clock signals; 

(b) a set of loop filters coupled to the corresponding phase detectors, the 
loop filters receiving the corresponding phase errors and generating filtered phase errors; 

(c) a set of oscillators coiq>led to the corresponding loop filters, the 
oscillators recdving the filtered phase errors and generating phase control signals; and 

(d) a set of phase selectors coupled to the corresponding oscillators, flie 
phase selectors receiving the phase control signals and generating the sampling clock 
signals. 

359. The timing recovery system of claim 358 wherein the set of clock signals 
fiirther comprises a receive clodc signal and wherein each of the processing subsystems 
finther con:q)rises a digital section, the digital sections operating in accordance with the 
receive clock signal 

360. The timing recovoy system of claim 359 \^dierein the recdve clodc signal 
is related to one of the sampling clock signals. 
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361. The tiiniiigre<»v<ay system ofclaim 360 further comprism^ 

and a receive clock phase selector, the first adder receiving one of the phase control 
signals and a receive clock ofl&et and generating a phase shift value, the receive clock 
phase selector receiving the phase shift value and generating the 'receive clock signal. 

362. The timing recovery system of claim 361 wherein the phase shift value 
comprises a set of phase steps and wherein the receive clock phase selector receives the 
phase shift value in the form of consecutive phase steps, 

363. The timing recovery system of claim 358 wherein the set of clock signals 
further comprises a transmit clock signal and wherein each of the subsystems further 
comprises a transmit section, the transmit sections operating in accordance with the 
transmit clock signaL 

364. The timing recovery system of claim 363 ftirther comprising a transmit 
clock phase selector, the transmit clock phase selector receiving a transmit clock offeet 
and generating die transmit clock signal. 

365. The timing recovery system of claim 364 wherein the transmit clock 
o&et is equal to zero. 

366. The timing recovery system of claim 363 wherein the transmit clock 
signal is related to one of the sampling clock signals. 

367. The timing recovery system of claim 366 fiirther comprising a transmit 
clock phase selector, the transmit clock phase selector receiving one of the phase control 
signals and generating the transmit clock signal. 

368. The timing recovery system of claim 358 wherein each of the phase 
detectors receives a corresponding slicer error and a corresponding tentative decision 
fiom a decoding ^stem. 

369. The timing recovery system of claim 368 wherdn each of the phase 
detectors comprises a lattice stmcture, the lattice structure comprising two delay 

elements, two multipUers and anadder,thelatticestructuregeneratingapre-cursorphase 
aior by multiplying the corresponding tentative decision by a delayed version of the 
corresponding sheer error and genraating a post-cursor phase error by multiplying the 
corresponding sheer error by a deUyed version of the corresponding tentative decision 
and combining the pre-cursor and post-cursor phase errors to produce the corresponding 
phase error. 
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370. ThetimingrecoveiysystemofcIaim369wherematleastoneofthephase 
detectors fiirthcr receives an offeet input fiom a control unit and wherein the associated 
lattice structure combines the pre-cursor, post-cursor phase errors and the ofiset input to 
produce flie corresponding phase enor. 

371. Thetimingrccoveiysystemof claim 358 wherein at least one of the loop 
filters comprises a first filter for accumulating a number of consecutive values of one of 
the phase enors to produce a filtered phase error. 

372. The timing recoveiy system of claim 358 wherein at least one of the loop 
filtersconqjrises afirstfiltra-fiffaocumulatrnganumberof consecutive valucsofoneof 
file phase errors to produce a sum value, a second filter for integrating the sum value to 
produce an integral value and an adder for combining the sum value and the integral 
value to produce a filtered phase error. 

373. The timing recoveiy system of claim 372 wherein the second filter 
includes a multiplier for scaling the integrated sum value by a scale fiujtor to produce the 
integral value. 

374. The timing recovery system of claun 372 wherein at least one of the loop 
filtcas fiirther comprises a multipUer for multiplying the sum value by a fector different 
than 1 when the system is operating m a different bandwidth mode. 

375. The timing recovery system of claim 358 wherein each of the oscillators 
comprises an infinite impvlse response filter for filtering recursively the filtered phase 
CTors to produce the corresponding phase control signals. 

376. Tlietimmgrecovery system of claim 375 wherein at least one of the 
oscillators fiirther comprises a multipUer for scaling the filtered phase errors by a scale 
fiictor and oulputting the scaled filtered phase errors to the associated impulse response 
filter. 



377. The timing recovery system of claim 358 wherein each of the phase 
selectors receives a multi-phase input signal fiom a clock generator and selects one of the 
phases of the multi-phase input signal based on the phase control signal received fiom 
the carrraponding oscillator. 

378. A timing recoveiy system for generating a set of clock signals in a 
processing system, the set of clock signals comprising a set of sampling clock signals, 
the processing system conqjrisingasetofprocessing subsystems, each of the processing 
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subsystems comprising an analog section, each of the analog sections operating in 
accordance with a conesponding one of the sampling clock signals, the timing recovery 
syston conq)rising: 

(a) a set of phase detectors generating phase acrors for the corresponding 
sampling clock signals; 

(b) a set of loop filt«s coupled to tiie corresponding phase detectors, tiie 
loop filters receiving the corresponding phase errors and generating filtered phase errors; 

(c) a set of digital-to-analog (D/A) converters coupled to the loop filters, 

theD/Aconverters receiving thefilteredphase errors and generating analogfiltered phase 
enors; and 

(d) a set of oscillators coupled to the corresponding D/A converters, the 
oscillators receiving the analog filtered phase errors and generating the sampling clock 
signals. 

379. Thetimingrecoverysystemofclaim 158 wherein the oscillators comprise 
varactor diodes. 
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